Predictive Obstacle Avoidance Algorithm for Under‐Actuated Unmanned Surface Vehicle Under Disturbances via Reinforcement Learning

ABSTRACT Due to the growing complexity of diverse maritime tasks, underactuated unmanned surface vehicle (USV) has become a research hotspot. The rapid development of deep reinforcement learning (DRL) technology has brought forth a novel approach for the USV autonomous control, rendering unnecessary...

Celý popis

Uloženo v:

Podrobná bibliografie
Vydáno v:	Journal of field robotics Ročník 42; číslo 7; s. 3482 - 3499
Hlavní autoři:	Jin, Kefan, Liu, Zhe, Wang, Jian
Médium:	Journal Article
Jazyk:	angličtina
Vydáno:	Hoboken Wiley Subscription Services, Inc 01.10.2025
Témata:	Collision avoidance Deep learning deep reinforcement learning (DRL) Disturbances Dynamic models Modules Obstacle avoidance Predictions state prediction Surface vehicles Task complexity unmanned surface vehicle (USV) control Unmanned vehicles
ISSN:	1556-4959, 1556-4967
On-line přístup:	Získat plný text
Tagy:	Přidat tag Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!

Popis
Shrnutí:	ABSTRACT Due to the growing complexity of diverse maritime tasks, underactuated unmanned surface vehicle (USV) has become a research hotspot. The rapid development of deep reinforcement learning (DRL) technology has brought forth a novel approach for the USV autonomous control, rendering unnecessary the dynamical modeling of the target USV. To further improve the USV collision avoidance performance against maritime disturbances, this paper presents a predictive reinforcement learning method for USV obstacle avoidance control. A prediction module is designed to generate latent features that depict environmental states. After that, the prediction feature is provided for a DRL‐based policy module to produce an action distribution for the underactuated unmanned surface vehicle. The proposed method in this paper can enable the USV avoid obstacle and reach the destination solely based on its local observational information, without relying on prior global information. Simulation and physical experiments have demonstrated that, compared to general DRL methods, the proposed method exhibits stronger robustness to environmental disturbances, enabling the USV to reach the destination while avoid the obstacle.
Bibliografie:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	1556-4959 1556-4967
DOI:	10.1002/rob.22554