Incremental value iteration for optimal output regulation of linear systems with unknown exosystems

This paper addresses the optimal output regulation problem for discrete-time linear systems with completely unknown dynamics and unmeasurable exosystem states. The primary objective is to design incremental dataset-based value iteration (VI) reinforcement learning algorithms to derive both state fee...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Neurocomputing (Amsterdam) Jg. 626; S. 129579
Hauptverfasser: Jing, Chonglin, Wang, Chaoli, Liang, Dong, Xu, Yujing, Hao, Longyan
Format: Journal Article
Sprache:Englisch
Veröffentlicht: Elsevier B.V 14.04.2025
Schlagworte:
ISSN:0925-2312
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:This paper addresses the optimal output regulation problem for discrete-time linear systems with completely unknown dynamics and unmeasurable exosystem states. The primary objective is to design incremental dataset-based value iteration (VI) reinforcement learning algorithms to derive both state feedback and output feedback controllers. In the context of data-driven optimal control, existing approaches typically require either the exosystem state to be measurable or the design of an autonomous system to reconstruct it. In contrast, this work proposes an incremental dataset-based VI algorithm, which eliminates the need for exosystem state measurement or reconstruction. Additionally, the proposed method allows for the selection of an arbitrary initial admissible control policy, thereby overcoming the challenge of requiring an initial admissible control in policy iteration algorithms. Furthermore, the system state is reconstructed using the incremental dataset, and an optimal output feedback controller is developed based on the proposed VI algorithm. The theoretical convergence of the dataset-based incremental VI algorithm is rigorously analyzed, and comprehensive simulations are conducted to validate its effectiveness.
ISSN:0925-2312
DOI:10.1016/j.neucom.2025.129579