ImageInThat: Manipulating Images to Convey User Instructions to Robots

Foundation models are rapidly improving the capability of robots in performing everyday tasks autonomously such as meal preparation, yet robots will still need to be instructed by humans due to model performance, the difficulty of capturing user preferences, and the need for user agency. Robots can...

Celý popis

Uloženo v:

Podrobná bibliografie
Vydáno v:	2025 20th ACM/IEEE International Conference on Human-Robot Interaction (HRI) s. 757 - 766
Hlavní autoři:	Mahadevan, Karthik, Lewis, Blaine, Li, Jiannan, Mutlu, Bilge, Tang, Anthony, Grossman, Tovi
Médium:	Konferenční příspěvek
Jazyk:	angličtina
Vydáno:	IEEE 04.03.2025
Témata:	Codes direct manipulation end-user robot programming Faces Foundation models Human-robot interaction Natural languages Prototypes robot instruction following Robot programming Robots
On-line přístup:	Získat plný text
Tagy:	Přidat tag Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!

Buďte první, kdo okomentuje tento záznam!