Reward-Guided Synthesis of Intelligent Agents with Control Structures

Deep reinforcement learning (RL) has led to encouraging successes in numerous challenging robotics applications. However, the lack of inductive biases to support logic deduction and generalization in the representation of a deep RL model causes it less effective in exploring complex long-horizon rob...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Proceedings of ACM on programming languages Jg. 8; H. PLDI; S. 1730 - 1754
Hauptverfasser:	Cui, Guofeng, Wang, Yuning, Qiu, Wenjie, Zhu, He
Format:	Journal Article
Sprache:	Englisch
Veröffentlicht:	New York, NY, USA ACM 20.06.2024
Schlagworte:	Automatic programming Software and its engineering Sequential Decision Making Program Synthesis
ISSN:	2475-1421, 2475-1421
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Schreiben Sie den ersten Kommentar!