LABANet: A Lightweight Asymmetrical Bottleneck and Attention-Based Network for Cloud Detection

Recently, onboard satellite cloud detection has attracted extensive interest due to its ability to reduce invalid data and save satellite storage and downlink resources. As the demand for accuracy has increased, deep neural networks with encoder-decoder architectures have been widely used for cloud...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:IEEE sensors journal Ročník 24; číslo 4; s. 4771 - 4785
Hlavní autori: Yu, Ximing, Peng, Yu, Shao, Wenyi, Liu, Liansheng, Sun, Kaipeng
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: New York IEEE 15.02.2024
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Predmet:
ISSN:1530-437X, 1558-1748
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Popis
Shrnutí:Recently, onboard satellite cloud detection has attracted extensive interest due to its ability to reduce invalid data and save satellite storage and downlink resources. As the demand for accuracy has increased, deep neural networks with encoder-decoder architectures have been widely used for cloud detection. Most networks mainly focus on extracting contextual features and fusing more information derived from multilevel features to achieve high accuracy. Convolutional layers with contextual feature extraction abilities in networks, however, always contain many computations and parameters, making them difficult to deploy in processors with limited computing and memory resources. Additionally, fusing features acquired from different levels easily introduces redundant noise and decreases accuracy. To address these issues, a novel lightweight asymmetrical bottleneck and attention-based network (LABANet) is proposed. In the encoder, a multiscale context-based asymmetrical bottleneck (MCAB) is designed to extract more contextual features in an efficient manner. The number of computations and parameters required by the bottleneck can be effectively reduced via asymmetric channel compression and grouping for different pointwise convolutions. Multiscale contextual features can be extracted by dilated depthwise asymmetric convolution. In the decoder, a spatial-channel cascade attention (SCCA) module is proposed to fuse multilevel features with less redundant noise by highlighting spatial and channel correlation information. Experimental results obtained on public datasets demonstrate that the proposed LABANet can exceed 95% overall accuracy (OA) with a computational cost of 1.20 giga-floating-point operations (GFLOPs) and a parameter count of <inline-formula> <tex-math notation="LaTeX">1.2\times 10^{{6}} </tex-math></inline-formula>; thus, it achieves competitive accuracy with fewer computations and parameters than other baseline cloud detection networks.
Bibliografia:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1530-437X
1558-1748
DOI:10.1109/JSEN.2023.3345386