Reward-Guided Synthesis of Intelligent Agents with Control Structures

Deep reinforcement learning (RL) has led to encouraging successes in numerous challenging robotics applications. However, the lack of inductive biases to support logic deduction and generalization in the representation of a deep RL model causes it less effective in exploring complex long-horizon rob...

Celý popis

Uloženo v:

Podrobná bibliografie
Vydáno v:	Proceedings of ACM on programming languages Ročník 8; číslo PLDI; s. 1730 - 1754
Hlavní autoři:	Cui, Guofeng, Wang, Yuning, Qiu, Wenjie, Zhu, He
Médium:	Journal Article
Jazyk:	angličtina
Vydáno:	New York, NY, USA ACM 20.06.2024
Témata:	Automatic programming Software and its engineering Sequential Decision Making Program Synthesis
ISSN:	2475-1421, 2475-1421
On-line přístup:	Získat plný text
Tagy:	Přidat tag Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!

Buďte první, kdo okomentuje tento záznam!