Predictive Obstacle Avoidance Algorithm for Under‐Actuated Unmanned Surface Vehicle Under Disturbances via Reinforcement Learning

ABSTRACT Due to the growing complexity of diverse maritime tasks, underactuated unmanned surface vehicle (USV) has become a research hotspot. The rapid development of deep reinforcement learning (DRL) technology has brought forth a novel approach for the USV autonomous control, rendering unnecessary...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:Journal of field robotics Ročník 42; číslo 7; s. 3482 - 3499
Hlavní autori: Jin, Kefan, Liu, Zhe, Wang, Jian
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: Hoboken Wiley Subscription Services, Inc 01.10.2025
Predmet:
ISSN:1556-4959, 1556-4967
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Popis
Shrnutí:ABSTRACT Due to the growing complexity of diverse maritime tasks, underactuated unmanned surface vehicle (USV) has become a research hotspot. The rapid development of deep reinforcement learning (DRL) technology has brought forth a novel approach for the USV autonomous control, rendering unnecessary the dynamical modeling of the target USV. To further improve the USV collision avoidance performance against maritime disturbances, this paper presents a predictive reinforcement learning method for USV obstacle avoidance control. A prediction module is designed to generate latent features that depict environmental states. After that, the prediction feature is provided for a DRL‐based policy module to produce an action distribution for the underactuated unmanned surface vehicle. The proposed method in this paper can enable the USV avoid obstacle and reach the destination solely based on its local observational information, without relying on prior global information. Simulation and physical experiments have demonstrated that, compared to general DRL methods, the proposed method exhibits stronger robustness to environmental disturbances, enabling the USV to reach the destination while avoid the obstacle.
Bibliografia:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1556-4959
1556-4967
DOI:10.1002/rob.22554