Real-Time Swing-up of a Linear Inverted Pendulum Using Reinforcement Learning

This study focused on applying and enhancing the Deep Deterministic Policy Gradient (DDPG) algorithm to effectively control a Single Inverted Pendulum (SIP) system. The primary objective was to improve the algorithm's performance by addressing common challenges such as overestimation of Q-value...

Celý popis

Uloženo v:

Podrobná bibliografie
Vydáno v:	Mechanika (Kaunas, Lithuania : 1995) Ročník 31; číslo 2; s. 123 - 135
Hlavní autoři:	BAJRAMI, Xhevahir, KAÇIU, Fisnik, SHALA, Erjon, LIKAJ, Rame
Médium:	Journal Article
Jazyk:	angličtina
Vydáno:	Kaunas Kaunas University of Technology 01.03.2025 Kauno Technologijos Universitetas
Témata:	Algorithms Computational linguistics Control engineering Control methods Deep learning Dynamical systems Language processing Machine learning Natural language interfaces Nonlinear dynamics Pendulums Robust control Stability single inverted pen deep deterministic policy gradient deep learning dynamical systems control systems reinforcement learning
ISSN:	1392-1207, 2029-6983
On-line přístup:	Získat plný text
Tagy:	Přidat tag Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!

Popis
Shrnutí:	This study focused on applying and enhancing the Deep Deterministic Policy Gradient (DDPG) algorithm to effectively control a Single Inverted Pendulum (SIP) system. The primary objective was to improve the algorithm's performance by addressing common challenges such as overestimation of Q-values and convergence to local optima. The system's behaviour was analyzed through simulation and real-world experiments, showcasing the algorithm's ability to offer faster responses, enhanced stability, and reduced pendulum displacement. The research introduced key modifications to the experience replay mechanism and the Critic network, which played a significant role in improving the efficiency of the learning process and the robustness of the control strategy. By combining Reinforcement Learning with traditional control methods, this approach successfully managed the nonlinear dynamics of the SIP system. Nevertheless, certain challenges persist, particularly in terms of the efficiency of deep reinforcement learning algorithms and their stability in real-world environments. These findings suggest that future research should focus on further refining DRL algorithms to increase their practical application in physical control systems. In conclusion, the research highlights the potential of combining DRL techniques with conventional control strategies for tackling complex control problems. The success achieved in controlling the SIP system indicates a promising direction for further exploration and development in this field.
Bibliografie:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	1392-1207 2029-6983
DOI:	10.5755/j02.mech.39202