A Synchronous Processing Method for Underwater Sound Source Recognition and Range Estimation Based on Multi-Task Learning and Transfer Learning

This study focuses on underwater sound sources and investigates the application of multi-task learning (MTL) in the synchronous processing of target recognition and range estimation to achieve systematic detection. To enhance the applicability of the MTL model with limited underwater acoustic data,...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Circuits, systems, and signal processing Ročník 44; číslo 9; s. 6845 - 6872
Hlavní autoři: Wang, Yong, Yao, Qihai, Yang, Yixin
Médium: Journal Article
Jazyk:angličtina
Vydáno: New York Springer US 01.09.2025
Springer Nature B.V
Témata:
ISSN:0278-081X, 1531-5878
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:This study focuses on underwater sound sources and investigates the application of multi-task learning (MTL) in the synchronous processing of target recognition and range estimation to achieve systematic detection. To enhance the applicability of the MTL model with limited underwater acoustic data, this study integrates transfer learning (TL) with MTL, forming the MTL-TL model. The proposed approach extracts input features of underwater acoustic targets, constructs an MTL model, pre-trains it using a large dataset from a pre-selected marine area, retrains it with a small dataset from the detected sea area based on TL, and performs target recognition and range estimation on test sets. The effectiveness of the proposed MTL-TL method is validated using data from the S5 voyage of SWellEX-96 experiment and is compared with three models: single-task learning (STL), STL-TL, and MTL. The results indicate that the combination of the regularization effect between subtasks of MTL and the empirical information advantage of TL enables the MTL-TL model to achieve accurate target recognition and range estimation, outperforming other models. Furthermore, the model remains effective in broadband target scenarios and cases involving two sound sources of the same frequency. Even at low signal-to-noise ratios, the model remains applicable, achieving the recognition accuracy above 79% and mean absolute percentage error below 8% at 0 dB.
Bibliografie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:0278-081X
1531-5878
DOI:10.1007/s00034-025-03127-4