Data‐driven policy iteration algorithm for continuous‐time stochastic linear‐quadratic optimal control problems

This paper studies a continuous‐time stochastic linear‐quadratic (SLQ) optimal control problem on infinite‐horizon. Combining the Kronecker product theory with an existing policy iteration algorithm, a data‐driven policy iteration algorithm is proposed to solve the problem. In contrast to most exist...

Full description

Saved in:

Bibliographic Details
Published in:	Asian journal of control Vol. 26; no. 1; pp. 481 - 489
Main Authors:	Zhang, Heng, Li, Na
Format:	Journal Article
Language:	English
Published:	Hoboken Wiley Subscription Services, Inc 01.01.2024
Subjects:	Data collection data‐driven Iterative algorithms Optimal control policy iteration Riccati equation stochastic algebraic Riccati equation stochastic linear‐quadratic optimal control problem Stochastic systems
ISSN:	1561-8625, 1934-6093
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	This paper studies a continuous‐time stochastic linear‐quadratic (SLQ) optimal control problem on infinite‐horizon. Combining the Kronecker product theory with an existing policy iteration algorithm, a data‐driven policy iteration algorithm is proposed to solve the problem. In contrast to most existing methods that need all information of system coefficients, the proposed algorithm eliminates the requirement of three system matrices by utilizing data of a stochastic system. More specifically, this algorithm uses the collected data to iteratively approximate the optimal control and a solution of the stochastic algebraic Riccati equation (SARE) corresponding to the SLQ optimal control problem. The convergence analysis of the obtained algorithm is given rigorously, and a simulation example is provided to illustrate the effectiveness and applicability of the algorithm.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	1561-8625 1934-6093
DOI:	10.1002/asjc.3223