Real-Time Swing-up of a Linear Inverted Pendulum Using Reinforcement Learning

This study focused on applying and enhancing the Deep Deterministic Policy Gradient (DDPG) algorithm to effectively control a Single Inverted Pendulum (SIP) system. The primary objective was to improve the algorithm's performance by addressing common challenges such as overestimation of Q-value...

Full description

Saved in:
Bibliographic Details
Published in:Mechanika (Kaunas, Lithuania : 1995) Vol. 31; no. 2; pp. 123 - 135
Main Authors: BAJRAMI, Xhevahir, KAÇIU, Fisnik, SHALA, Erjon, LIKAJ, Rame
Format: Journal Article
Language:English
Published: Kaunas Kaunas University of Technology 01.03.2025
Kauno Technologijos Universitetas
Subjects:
ISSN:1392-1207, 2029-6983
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This study focused on applying and enhancing the Deep Deterministic Policy Gradient (DDPG) algorithm to effectively control a Single Inverted Pendulum (SIP) system. The primary objective was to improve the algorithm's performance by addressing common challenges such as overestimation of Q-values and convergence to local optima. The system's behaviour was analyzed through simulation and real-world experiments, showcasing the algorithm's ability to offer faster responses, enhanced stability, and reduced pendulum displacement. The research introduced key modifications to the experience replay mechanism and the Critic network, which played a significant role in improving the efficiency of the learning process and the robustness of the control strategy. By combining Reinforcement Learning with traditional control methods, this approach successfully managed the nonlinear dynamics of the SIP system. Nevertheless, certain challenges persist, particularly in terms of the efficiency of deep reinforcement learning algorithms and their stability in real-world environments. These findings suggest that future research should focus on further refining DRL algorithms to increase their practical application in physical control systems. In conclusion, the research highlights the potential of combining DRL techniques with conventional control strategies for tackling complex control problems. The success achieved in controlling the SIP system indicates a promising direction for further exploration and development in this field.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1392-1207
2029-6983
DOI:10.5755/j02.mech.39202