WSA-YOLOv5s: improved YOLOv5s based on window self-attention module for ship detection.

Uloženo v:
Podrobná bibliografie
Název: WSA-YOLOv5s: improved YOLOv5s based on window self-attention module for ship detection.
Autoři: Zhou, Weina, Wang, Hong, Wu, Xintao
Zdroj: Pattern Analysis & Applications; Dec2024, Vol. 27 Issue 4, p1-12, 12p
Abstrakt: Ship detection from visual image (SDVI) plays a significant role in terminal management, cross-border ship detection and marine target tracking. Compared to synthetic aperture radar (SDSAR) based ship detection, SDVI has superior performance on accuracy and speed. WSA-YOLOv5s, a new algorithm with a Window Self-Attention (WSA) module is proposed in this paper to substantially improve the ability of detecting small targets while enhancing the ability of large target recognition marginally. The following fine tunings are made based on the original YOLOv5s network. (1) Perform target data enhancement for the dataset’s imbalanced classification by using the improved mix-up algorithm and focal loss; (2) Replace CSP structure in the original backbone with the WSA module, and add one more interaction layer than the original to increase the exchange of high-level semantic information; (3) Add CBAM module to reduce the influence of background interference factors by using spatial attention and channel attention. Experiments are carried out on the Singapore Maritime Dataset (SMD). The results show that our model is greatly improved in all aspects. And the mAP increases from 73 to 90.5%, while keeping the number of model parameters small. [ABSTRACT FROM AUTHOR]
Copyright of Pattern Analysis & Applications is the property of Springer Nature and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Databáze: Complementary Index
Popis
Abstrakt:Ship detection from visual image (SDVI) plays a significant role in terminal management, cross-border ship detection and marine target tracking. Compared to synthetic aperture radar (SDSAR) based ship detection, SDVI has superior performance on accuracy and speed. WSA-YOLOv5s, a new algorithm with a Window Self-Attention (WSA) module is proposed in this paper to substantially improve the ability of detecting small targets while enhancing the ability of large target recognition marginally. The following fine tunings are made based on the original YOLOv5s network. (1) Perform target data enhancement for the dataset’s imbalanced classification by using the improved mix-up algorithm and focal loss; (2) Replace CSP structure in the original backbone with the WSA module, and add one more interaction layer than the original to increase the exchange of high-level semantic information; (3) Add CBAM module to reduce the influence of background interference factors by using spatial attention and channel attention. Experiments are carried out on the Singapore Maritime Dataset (SMD). The results show that our model is greatly improved in all aspects. And the mAP increases from 73 to 90.5%, while keeping the number of model parameters small. [ABSTRACT FROM AUTHOR]
ISSN:14337541
DOI:10.1007/s10044-024-01333-5