Learning Robust Control Policies for End-to-End Autonomous Driving From Data-Driven Simulation

In this work, we present a data-driven simulation and training engine capable of learning end-to-end autonomous vehicle control policies using only sparse rewards. By leveraging real, human-collected trajectories through an environment, we render novel training data that allows virtual agents to dri...

Celý popis

Uloženo v:

Podrobná bibliografie
Vydáno v:	IEEE robotics and automation letters Ročník 5; číslo 2; s. 1142 - 1149
Hlavní autoři:	Amini, Alexander, Gilitschenski, Igor, Phillips, Jacob, Moseyko, Julia, Banerjee, Rohan, Karaman, Sertac, Rus, Daniela
Médium:	Journal Article
Jazyk:	angličtina
Vydáno:	Piscataway IEEE 01.04.2020 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Témata:	Access control autonomous agents Autonomous vehicles data-driven simulation Deep learning in robotics and automation Machine learning Policies real world reinforcement learning Roads & highways Robust control Semantics Simulation Training
ISSN:	2377-3766, 2377-3766
On-line přístup:	Získat plný text
Tagy:	Přidat tag Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!

Popis
Shrnutí:	In this work, we present a data-driven simulation and training engine capable of learning end-to-end autonomous vehicle control policies using only sparse rewards. By leveraging real, human-collected trajectories through an environment, we render novel training data that allows virtual agents to drive along a continuum of new local trajectories consistent with the road appearance and semantics, each with a different view of the scene. We demonstrate the ability of policies learned within our simulator to generalize to and navigate in previously unseen real-world roads, without access to any human control labels during training. Our results validate the learned policy onboard a full-scale autonomous vehicle, including in previously un-encountered scenarios, such as new roads and novel, complex, near-crash situations. Our methods are scalable, leverage reinforcement learning, and apply broadly to situations requiring effective perception and robust operation in the physical world.
Bibliografie:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	2377-3766 2377-3766
DOI:	10.1109/LRA.2020.2966414