Novel task decomposed multi-agent twin delayed deep deterministic policy gradient algorithm for multi-UAV autonomous path planning

•This study develops a novel TD-MATD3 model to improve convergence efficiency.•A novel reward function is also designed to facilitate the convergence of the algorithm.•The TD-MATD3 can obtain superior performance in complex dynamic environments. Path planning is one of the most essential parts of ta...

Full description

Saved in:

Bibliographic Details
Published in:	Knowledge-based systems Vol. 287; p. 111462
Main Authors:	Zhou, Yatong, Kong, Xiaoran, Lin, Kuo-Ping, Liu, Liangyu
Format:	Journal Article
Language:	English
Published:	Elsevier B.V 05.03.2024
Subjects:	Decomposed Actor-Critic network DRL MATD3 Multiple UAVs Path planning Path planning MATD3 Decomposed Actor-Critic network Multiple UAVs DRL
ISSN:	0950-7051, 1872-7409
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Be the first to leave a comment!