Preferential Proximal Policy Optimization

The Proximal Policy Optimization (PPO) is a policy gradient approach providing state-of-the-art performance in many domains through the "surrogate" objective function using stochastic gradient ascent. While PPO is an appealing approach in reinforcement learning, it does not consider the im...

Full description

Saved in:

Bibliographic Details
Published in:	Proceedings (IEEE International Conference on Emerging Technologies and Factory Automation) pp. 293 - 300
Main Authors:	Balasuntharam, Tamilselvan, Davoudi, Heidar, Ebrahimi, Mehran
Format:	Conference Proceeding
Language:	English
Published:	IEEE 15.12.2023
Subjects:	Deep Learning Deep Reinforcement Learning Estimation Linear programming Machine Learning Machine learning algorithms Optimization Reinforcement learning Trajectory
ISSN:	1946-0759
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Be the first to leave a comment!