Reward-Guided Synthesis of Intelligent Agents with Control Structures

Deep reinforcement learning (RL) has led to encouraging successes in numerous challenging robotics applications. However, the lack of inductive biases to support logic deduction and generalization in the representation of a deep RL model causes it less effective in exploring complex long-horizon rob...

Full description

Saved in:

Bibliographic Details
Published in:	Proceedings of ACM on programming languages Vol. 8; no. PLDI; pp. 1730 - 1754
Main Authors:	Cui, Guofeng, Wang, Yuning, Qiu, Wenjie, Zhu, He
Format:	Journal Article
Language:	English
Published:	New York, NY, USA ACM 20.06.2024
Subjects:	Automatic programming Software and its engineering Sequential Decision Making Program Synthesis
ISSN:	2475-1421, 2475-1421
Online Access:	Get full text
Tags:	Add Tag No Tags, Be the first to tag this record!

Be the first to leave a comment!