Infinite time linear quadratic stackelberg game problem for unknown stochastic discrete‐time systems via adaptive dynamic programming approach

In this paper, we propose an adaptive dynamic programming (ADP) approach to solve the infinite horizon linear quadratic (LQ) Stackelberg game problem for unknown stochastic discrete‐time systems with multiple decision makers. Firstly, the stochastic LQ Stackelberg game problem is converted into the...

Full description

Saved in:
Bibliographic Details
Published in:Asian journal of control Vol. 23; no. 2; pp. 937 - 948
Main Authors: Liu, Xikui, Liu, Ruirui, Li, Yan
Format: Journal Article
Language:English
Published: Hoboken Wiley Subscription Services, Inc 01.03.2021
Subjects:
ISSN:1561-8625, 1934-6093
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:In this paper, we propose an adaptive dynamic programming (ADP) approach to solve the infinite horizon linear quadratic (LQ) Stackelberg game problem for unknown stochastic discrete‐time systems with multiple decision makers. Firstly, the stochastic LQ Stackelberg game problem is converted into the deterministic problem by system transformation. Next, a value iteration ADP approach is put forword and the convergence is given. Thirdly, in order to implement the iterative method, back propagation neural network (BPNN) is chosen to design model network, critic network and action network to approximate the unknown systems, objective functions and Stackelberg strategies. Finally, simulation results show that the algorithm is effective.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1561-8625
1934-6093
DOI:10.1002/asjc.2276