Novel task decomposed multi-agent twin delayed deep deterministic policy gradient algorithm for multi-UAV autonomous path planning

•This study develops a novel TD-MATD3 model to improve convergence efficiency.•A novel reward function is also designed to facilitate the convergence of the algorithm.•The TD-MATD3 can obtain superior performance in complex dynamic environments. Path planning is one of the most essential parts of ta...

Full description

Saved in:
Bibliographic Details
Published in:Knowledge-based systems Vol. 287; p. 111462
Main Authors: Zhou, Yatong, Kong, Xiaoran, Lin, Kuo-Ping, Liu, Liangyu
Format: Journal Article
Language:English
Published: Elsevier B.V 05.03.2024
Subjects:
ISSN:0950-7051, 1872-7409
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Be the first to leave a comment!
You must be logged in first