Multi-UAV Autonomous Obstacle Avoidance Based on Reinforcement Learning

Obstacle avoidance is a necessary behaviour to ensure the safety of UAV. In this paper, aiming at the problem of autonomous learning and obstacle avoidance of multiple UAVs in the multi-obstacle map environment, an obstacle avoidance method of UAVs based on an improved reward deep Q learning network...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Chinese Control Conference S. 8657 - 8661
Hauptverfasser: Li, Zheng, Li, Jinna, Wang, Yanhui
Format: Tagungsbericht
Sprache:Englisch
Veröffentlicht: Technical Committee on Control Theory, Chinese Association of Automation 24.07.2023
Schlagworte:
ISSN:1934-1768
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Obstacle avoidance is a necessary behaviour to ensure the safety of UAV. In this paper, aiming at the problem of autonomous learning and obstacle avoidance of multiple UAVs in the multi-obstacle map environment, an obstacle avoidance method of UAVs based on an improved reward deep Q learning network is proposed. According to the dynamics model of UAV, the three-dimensional dynamic equation is established, and the combination of pitch angle and heading angle constructs the action space of UAV. a new reward evaluation method, which adaptively adjusts the weight of the reward according to the distance between the UAV and the obstacle, is developed thus improving the performance of the UAV. Thus, the obstacle avoidance performance of multiple UAVs in unknown environment can be effectively improved. Finally, by comparing the traditional deep Q learning network, the simulation results show that the algorithm in this paper is reasonable, and UAVs can successfully achieve obstacle avoidance in unknown environment.
ISSN:1934-1768
DOI:10.23919/CCC58697.2023.10240772