Bias-Policy Iteration-Based Adaptive Dynamic Programming for Optimal Control of Discrete-Time Nonlinear Systems
This paper presents the bias-policy iteration, a modified adaptive dynamic programming method, to achieve optimal control design of discrete-time nonlinear systems. Firstly, the formulation of the bias-policy iteration method and the thorough convergence analysis are provided. By leveraging the bias...
Gespeichert in:
| Veröffentlicht in: | IEEE transactions on circuits and systems. I, Regular papers Jg. 72; H. 8; S. 4284 - 4296 |
|---|---|
| Hauptverfasser: | , , , |
| Format: | Journal Article |
| Sprache: | Englisch |
| Veröffentlicht: |
New York
IEEE
01.08.2025
The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
| Schlagworte: | |
| ISSN: | 1549-8328, 1558-0806 |
| Online-Zugang: | Volltext |
| Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
| Zusammenfassung: | This paper presents the bias-policy iteration, a modified adaptive dynamic programming method, to achieve optimal control design of discrete-time nonlinear systems. Firstly, the formulation of the bias-policy iteration method and the thorough convergence analysis are provided. By leveraging the bias parameter, the constraint of admissible control is relaxed while the fast convergence of traditional policy iteration is inherited. The actor-critic framework is utilized to realize the implementation of the proposed method accordingly. Finally, the proposed method is applied to optimal control problem of the inverted pendulum system. The simulation is conducted to verify the effectiveness of the bias-policy iteration approach. |
|---|---|
| Bibliographie: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
| ISSN: | 1549-8328 1558-0806 |
| DOI: | 10.1109/TCSI.2024.3492255 |