Novel task decomposed multi-agent twin delayed deep deterministic policy gradient algorithm for multi-UAV autonomous path planning

•This study develops a novel TD-MATD3 model to improve convergence efficiency.•A novel reward function is also designed to facilitate the convergence of the algorithm.•The TD-MATD3 can obtain superior performance in complex dynamic environments. Path planning is one of the most essential parts of ta...

Celý popis

Uloženo v:

Podrobná bibliografie
Vydáno v:	Knowledge-based systems Ročník 287; s. 111462
Hlavní autoři:	Zhou, Yatong, Kong, Xiaoran, Lin, Kuo-Ping, Liu, Liangyu
Médium:	Journal Article
Jazyk:	angličtina
Vydáno:	Elsevier B.V 05.03.2024
Témata:	Decomposed Actor-Critic network DRL MATD3 Multiple UAVs Path planning Path planning MATD3 Decomposed Actor-Critic network Multiple UAVs DRL
ISSN:	0950-7051, 1872-7409
On-line přístup:	Získat plný text
Tagy:	Přidat tag Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!

Buďte první, kdo okomentuje tento záznam!