Optimal dynamic output feedback control of unknown linear continuous-time systems by adaptive dynamic programming

In this paper, we present an approximate optimal dynamic output feedback control learning algorithm to solve the linear quadratic regulation problem for unknown linear continuous-time systems. First, a dynamic output feedback controller is designed by constructing the internal state. Then, an adapti...

Celý popis

Uloženo v:

Podrobná bibliografie
Vydáno v:	Automatica (Oxford) Ročník 163; s. 111601
Hlavní autoři:	Xie, Kedi, Zheng, Yiwei, Jiang, Yi, Lan, Weiyao, Yu, Xiao
Médium:	Journal Article
Jazyk:	angličtina
Vydáno:	Elsevier Ltd 01.05.2024
Témata:	Adaptive dynamic programming Dynamic output feedback control Linear quadratic regulation Value iteration Dynamic output feedback control Value iteration Linear quadratic regulation Adaptive dynamic programming
ISSN:	0005-1098, 1873-2836
On-line přístup:	Získat plný text
Tagy:	Přidat tag Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!

Popis
Shrnutí:	In this paper, we present an approximate optimal dynamic output feedback control learning algorithm to solve the linear quadratic regulation problem for unknown linear continuous-time systems. First, a dynamic output feedback controller is designed by constructing the internal state. Then, an adaptive dynamic programming based learning algorithm is proposed to estimate the optimal feedback control gain by only accessing the input and output data. By adding a constructed virtual observer error into the iterative learning equation, the proposed learning algorithm with the new iterative learning equation is immune to the observer error. In addition, the value iteration based learning equation is established without storing a series of past data, which could lead to a reduction of demands on the usage of memory storage. Besides, the proposed algorithm eliminates the requirement of repeated finite window integrals, which may reduce the computational load. Moreover, the convergence analysis shows that the estimated control policy converges to the optimal control policy. Finally, a physical experiment on an unmanned quadrotor is given to illustrate the effectiveness of the proposed approach.
ISSN:	0005-1098 1873-2836
DOI:	10.1016/j.automatica.2024.111601