Optimal dynamic output feedback control of unknown linear continuous-time systems by adaptive dynamic programming

In this paper, we present an approximate optimal dynamic output feedback control learning algorithm to solve the linear quadratic regulation problem for unknown linear continuous-time systems. First, a dynamic output feedback controller is designed by constructing the internal state. Then, an adapti...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Automatica (Oxford) Jg. 163; S. 111601
Hauptverfasser:	Xie, Kedi, Zheng, Yiwei, Jiang, Yi, Lan, Weiyao, Yu, Xiao
Format:	Journal Article
Sprache:	Englisch
Veröffentlicht:	Elsevier Ltd 01.05.2024
Schlagworte:	Adaptive dynamic programming Dynamic output feedback control Linear quadratic regulation Value iteration Dynamic output feedback control Value iteration Linear quadratic regulation Adaptive dynamic programming
ISSN:	0005-1098, 1873-2836
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	In this paper, we present an approximate optimal dynamic output feedback control learning algorithm to solve the linear quadratic regulation problem for unknown linear continuous-time systems. First, a dynamic output feedback controller is designed by constructing the internal state. Then, an adaptive dynamic programming based learning algorithm is proposed to estimate the optimal feedback control gain by only accessing the input and output data. By adding a constructed virtual observer error into the iterative learning equation, the proposed learning algorithm with the new iterative learning equation is immune to the observer error. In addition, the value iteration based learning equation is established without storing a series of past data, which could lead to a reduction of demands on the usage of memory storage. Besides, the proposed algorithm eliminates the requirement of repeated finite window integrals, which may reduce the computational load. Moreover, the convergence analysis shows that the estimated control policy converges to the optimal control policy. Finally, a physical experiment on an unmanned quadrotor is given to illustrate the effectiveness of the proposed approach.
ISSN:	0005-1098 1873-2836
DOI:	10.1016/j.automatica.2024.111601