Optimal control of a two‐wheeled self‐balancing robot by reinforcement learning

Summary This article concerns optimal control of the linear motion, tilt motion, and yaw motion of a two‐wheeled self‐balancing robot (TWSBR). Traditional optimal control methods for the TWSBR usually require a precise model of the system, and other control methods exist that achieve stabilization i...

Full description

Saved in:
Bibliographic Details
Published in:International journal of robust and nonlinear control Vol. 31; no. 6; pp. 1885 - 1904
Main Authors: Guo, Linyuan, Rizvi, Syed Ali Asad, Lin, Zongli
Format: Journal Article
Language:English
Published: Bognor Regis Wiley Subscription Services, Inc 01.04.2021
Subjects:
ISSN:1049-8923, 1099-1239
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Summary This article concerns optimal control of the linear motion, tilt motion, and yaw motion of a two‐wheeled self‐balancing robot (TWSBR). Traditional optimal control methods for the TWSBR usually require a precise model of the system, and other control methods exist that achieve stabilization in the face of parameter uncertainties. In practical applications, it is often desirable to realize optimal control in the absence of the precise knowledge of the system parameters. This article proposes to use a new feedback‐based reinforcement learning method to solve the linear quadratic regulation (LQR) control problem for the TWSBR. The proposed control scheme is completely online and does not require any knowledge of the system parameters. The proposed input decoupling mechanism and pre‐feedback law overcome the commonly encountered computational difficulties in implementing the learning algorithms. Both state feedback optimal control and output feedback optimal control are presented. Numerical simulation shows that the proposed optimal control scheme is capable of stabilizing the system and converging to the LQR solution obtained through solving the algebraic Riccati equation.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1049-8923
1099-1239
DOI:10.1002/rnc.5058