Preferential Proximal Policy Optimization
The Proximal Policy Optimization (PPO) is a policy gradient approach providing state-of-the-art performance in many domains through the "surrogate" objective function using stochastic gradient ascent. While PPO is an appealing approach in reinforcement learning, it does not consider the im...
Saved in:
| Published in: | Proceedings (IEEE International Conference on Emerging Technologies and Factory Automation) pp. 293 - 300 |
|---|---|
| Main Authors: | , , |
| Format: | Conference Proceeding |
| Language: | English |
| Published: |
IEEE
15.12.2023
|
| Subjects: | |
| ISSN: | 1946-0759 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Be the first to leave a comment!