Optimal control of a two‐wheeled self‐balancing robot by reinforcement learning

Summary This article concerns optimal control of the linear motion, tilt motion, and yaw motion of a two‐wheeled self‐balancing robot (TWSBR). Traditional optimal control methods for the TWSBR usually require a precise model of the system, and other control methods exist that achieve stabilization i...

Celý popis

Uloženo v:

Podrobná bibliografie
Vydáno v:	International journal of robust and nonlinear control Ročník 31; číslo 6; s. 1885 - 1904
Hlavní autoři:	Guo, Linyuan, Rizvi, Syed Ali Asad, Lin, Zongli
Médium:	Journal Article
Jazyk:	angličtina
Vydáno:	Bognor Regis Wiley Subscription Services, Inc 01.04.2021
Témata:	Algorithms Balancing Control methods Control systems Decoupling Machine learning Mathematical models Optimal control Output feedback Parameter uncertainty Q‐learning reinforcement learning Riccati equation Robot control robustness State feedback two‐wheeled self‐balancing robot Yaw
ISSN:	1049-8923, 1099-1239
On-line přístup:	Získat plný text
Tagy:	Přidat tag Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!

Popis
Shrnutí:	Summary This article concerns optimal control of the linear motion, tilt motion, and yaw motion of a two‐wheeled self‐balancing robot (TWSBR). Traditional optimal control methods for the TWSBR usually require a precise model of the system, and other control methods exist that achieve stabilization in the face of parameter uncertainties. In practical applications, it is often desirable to realize optimal control in the absence of the precise knowledge of the system parameters. This article proposes to use a new feedback‐based reinforcement learning method to solve the linear quadratic regulation (LQR) control problem for the TWSBR. The proposed control scheme is completely online and does not require any knowledge of the system parameters. The proposed input decoupling mechanism and pre‐feedback law overcome the commonly encountered computational difficulties in implementing the learning algorithms. Both state feedback optimal control and output feedback optimal control are presented. Numerical simulation shows that the proposed optimal control scheme is capable of stabilizing the system and converging to the LQR solution obtained through solving the algebraic Riccati equation.
Bibliografie:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	1049-8923 1099-1239
DOI:	10.1002/rnc.5058