Data‐driven policy iteration algorithm for continuous‐time stochastic linear‐quadratic optimal control problems

This paper studies a continuous‐time stochastic linear‐quadratic (SLQ) optimal control problem on infinite‐horizon. Combining the Kronecker product theory with an existing policy iteration algorithm, a data‐driven policy iteration algorithm is proposed to solve the problem. In contrast to most exist...

Celý popis

Uloženo v:

Podrobná bibliografie
Vydáno v:	Asian journal of control Ročník 26; číslo 1; s. 481 - 489
Hlavní autoři:	Zhang, Heng, Li, Na
Médium:	Journal Article
Jazyk:	angličtina
Vydáno:	Hoboken Wiley Subscription Services, Inc 01.01.2024
Témata:	Data collection data‐driven Iterative algorithms Optimal control policy iteration Riccati equation stochastic algebraic Riccati equation stochastic linear‐quadratic optimal control problem Stochastic systems
ISSN:	1561-8625, 1934-6093
On-line přístup:	Získat plný text
Tagy:	Přidat tag Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!

Popis
Shrnutí:	This paper studies a continuous‐time stochastic linear‐quadratic (SLQ) optimal control problem on infinite‐horizon. Combining the Kronecker product theory with an existing policy iteration algorithm, a data‐driven policy iteration algorithm is proposed to solve the problem. In contrast to most existing methods that need all information of system coefficients, the proposed algorithm eliminates the requirement of three system matrices by utilizing data of a stochastic system. More specifically, this algorithm uses the collected data to iteratively approximate the optimal control and a solution of the stochastic algebraic Riccati equation (SARE) corresponding to the SLQ optimal control problem. The convergence analysis of the obtained algorithm is given rigorously, and a simulation example is provided to illustrate the effectiveness and applicability of the algorithm.
Bibliografie:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	1561-8625 1934-6093
DOI:	10.1002/asjc.3223