Finite-Approximation-Error-Based Optimal Control Approach for Discrete-Time Nonlinear Systems

In this paper, a new iterative adaptive dynamic programming (ADP) algorithm is developed to solve optimal control problems for infinite-horizon discrete-time nonlinear systems with finite approximation errors. The idea is to use an iterative ADP algorithm to obtain the iterative control law that mak...

Celý popis

Uloženo v:

Podrobná bibliografie
Vydáno v:	IEEE transactions on cybernetics Ročník 43; číslo 2; s. 779 - 789
Hlavní autoři:	Liu, Derong, Wei, Qinglai
Médium:	Journal Article
Jazyk:	angličtina
Vydáno:	United States IEEE 01.04.2013 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Témata:	Adaptive dynamic programming (ADP) Algorithms approximate dynamic programming Approximation algorithms Approximation error Convergence Dynamic programming Dynamical systems finite approximation errors Heuristic algorithms Iterative algorithms Iterative methods Mathematical analysis Mathematical models Neural networks Nonlinear dynamics Nonlinear systems Optimal control Optimization Performance analysis Performance indices Studies
ISSN:	2168-2267, 2168-2275, 2168-2275
On-line přístup:	Získat plný text
Tagy:	Přidat tag Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!

Popis
Shrnutí:	In this paper, a new iterative adaptive dynamic programming (ADP) algorithm is developed to solve optimal control problems for infinite-horizon discrete-time nonlinear systems with finite approximation errors. The idea is to use an iterative ADP algorithm to obtain the iterative control law that makes the iterative performance index function reach the optimum. When the iterative control law and the iterative performance index function in each iteration cannot be accurately obtained, the convergence conditions of the iterative ADP algorithm are obtained. When convergence conditions are satisfied, it is shown that the iterative performance index functions can converge to a finite neighborhood of the greatest lower bound of all performance index functions under some mild assumptions. Neural networks are used to approximate the performance index function and compute the optimal control policy, respectively, for facilitating the implementation of the iterative ADP algorithm. Finally, two simulation examples are given to illustrate the performance of the present method.
Bibliografie:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 ObjectType-Article-2 ObjectType-Feature-1 content type line 23
ISSN:	2168-2267 2168-2275 2168-2275
DOI:	10.1109/TSMCB.2012.2216523