Adaptive Dynamic Programming for Optimal Control of Discrete-Time Nonlinear Systems With Trajectory-Based Initial Control Policy

The policy gradient adaptive dynamic programming (PGADP) technique has gained recognition as an effective approach for optimizing the performance of nonlinear systems. Nonetheless, existing PGADP algorithms often demand a substantial volume of expensive or potentially risky interaction data with the...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on systems, man, and cybernetics. Systems Jg. 54; H. 3; S. 1489 - 1501
Hauptverfasser:	Xu, Jiahui, Wang, Jingcheng, Rao, Jun, Wu, Shunyu, Zhong, Yanjiu
Format:	Journal Article
Sprache:	Englisch
Veröffentlicht:	New York IEEE 01.03.2024 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Schlagworte:	Adaptive control Adaptive dynamic programming (ADP) Algorithms Closed loops Controllers Discrete time systems Dynamic programming Feedback control Heuristic algorithms initial control policy Neural networks Nonlinear control Nonlinear systems Optimal control Optimization OptNet Predictive control System dynamics System effectiveness Training Trajectory Trajectory control
ISSN:	2168-2216, 2168-2232
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Schreiben Sie den ersten Kommentar!