Novel task decomposed multi-agent twin delayed deep deterministic policy gradient algorithm for multi-UAV autonomous path planning
•This study develops a novel TD-MATD3 model to improve convergence efficiency.•A novel reward function is also designed to facilitate the convergence of the algorithm.•The TD-MATD3 can obtain superior performance in complex dynamic environments. Path planning is one of the most essential parts of ta...
Saved in:
| Published in: | Knowledge-based systems Vol. 287; p. 111462 |
|---|---|
| Main Authors: | , , , |
| Format: | Journal Article |
| Language: | English |
| Published: |
Elsevier B.V
05.03.2024
|
| Subjects: | |
| ISSN: | 0950-7051, 1872-7409 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Be the first to leave a comment!