Bias-Policy Iteration-Based Adaptive Dynamic Programming for Optimal Control of Discrete-Time Nonlinear Systems

This paper presents the bias-policy iteration, a modified adaptive dynamic programming method, to achieve optimal control design of discrete-time nonlinear systems. Firstly, the formulation of the bias-policy iteration method and the thorough convergence analysis are provided. By leveraging the bias...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:IEEE transactions on circuits and systems. I, Regular papers Ročník 72; číslo 8; s. 4284 - 4296
Hlavní autoři: Jiang, Huaiyuan, Li, Xiang, Zhou, Bin, Cao, Xibin
Médium: Journal Article
Jazyk:angličtina
Vydáno: New York IEEE 01.08.2025
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Témata:
ISSN:1549-8328, 1558-0806
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:This paper presents the bias-policy iteration, a modified adaptive dynamic programming method, to achieve optimal control design of discrete-time nonlinear systems. Firstly, the formulation of the bias-policy iteration method and the thorough convergence analysis are provided. By leveraging the bias parameter, the constraint of admissible control is relaxed while the fast convergence of traditional policy iteration is inherited. The actor-critic framework is utilized to realize the implementation of the proposed method accordingly. Finally, the proposed method is applied to optimal control problem of the inverted pendulum system. The simulation is conducted to verify the effectiveness of the bias-policy iteration approach.
Bibliografie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1549-8328
1558-0806
DOI:10.1109/TCSI.2024.3492255