OVITA: Open-Vocabulary Interpretable Trajectory Adaptations
Adapting trajectories to dynamic situations and user preferences is crucial for robot operation in unstructured environments with non-expert users. Natural language enables users to express these adjustments in an interactive manner. We introduce OVITA, an interpretable, open-vocabulary, language-dr...
Uložené v:
| Vydané v: | IEEE robotics and automation letters Ročník 10; číslo 11; s. 11054 - 11061 |
|---|---|
| Hlavní autori: | , , , |
| Médium: | Journal Article |
| Jazyk: | English |
| Vydavateľské údaje: |
Piscataway
IEEE
01.11.2025
The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
| Predmet: | |
| ISSN: | 2377-3766, 2377-3766 |
| On-line prístup: | Získať plný text |
| Tagy: |
Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
|
| Abstract | Adapting trajectories to dynamic situations and user preferences is crucial for robot operation in unstructured environments with non-expert users. Natural language enables users to express these adjustments in an interactive manner. We introduce OVITA, an interpretable, open-vocabulary, language-driven framework designed for adapting robot trajectories in dynamic and novel situations based on human instructions. OVITA leverages multiple pre-trained Large Language Models (LLMs) to integrate user commands into trajectories generated by motion planners or those learned through demonstrations. OVITA employs code as an adaptation policy generated by an LLM, enabling users to adjust individual waypoints, thus providing flexible control. Another LLM, which acts as a code explainer, removes the need for expert users, enabling intuitive interactions. The efficacy and significance of the proposed OVITA framework is demonstrated through extensive simulations and real-world environments with diverse tasks involving spatiotemporal variations on heterogeneous robotic platforms such as a KUKA IIWA robot manipulator, Clearpath Jackal ground robot, and CrazyFlie drone. |
|---|---|
| AbstractList | Adapting trajectories to dynamic situations and user preferences is crucial for robot operation in unstructured environments with non-expert users. Natural language enables users to express these adjustments in an interactive manner. We introduce OVITA, an interpretable, open-vocabulary, language-driven framework designed for adapting robot trajectories in dynamic and novel situations based on human instructions. OVITA leverages multiple pre-trained Large Language Models (LLMs) to integrate user commands into trajectories generated by motion planners or those learned through demonstrations. OVITA employs code as an adaptation policy generated by an LLM, enabling users to adjust individual waypoints, thus providing flexible control. Another LLM, which acts as a code explainer, removes the need for expert users, enabling intuitive interactions. The efficacy and significance of the proposed OVITA framework is demonstrated through extensive simulations and real-world environments with diverse tasks involving spatiotemporal variations on heterogeneous robotic platforms such as a KUKA IIWA robot manipulator, Clearpath Jackal ground robot, and CrazyFlie drone. |
| Author | Ghosh, Tashmoy Prakash, Ravi Nguyen, Anh Maurya, Anurag |
| Author_xml | – sequence: 1 givenname: Anurag orcidid: 0009-0003-2651-1757 surname: Maurya fullname: Maurya, Anurag email: anuragm1@iisc.ac.in organization: Human-interactive Robotics (HiRo) Lab, Cyber-Physical Systems, Indian Institute of Science, Bangalore, India – sequence: 2 givenname: Tashmoy orcidid: 0009-0001-1662-9378 surname: Ghosh fullname: Ghosh, Tashmoy email: tashmoyg@iisc.ac.in organization: Human-interactive Robotics (HiRo) Lab, Cyber-Physical Systems, Indian Institute of Science, Bangalore, India – sequence: 3 givenname: Anh orcidid: 0000-0002-1449-211X surname: Nguyen fullname: Nguyen, Anh email: anh.nguyen@liverpool.ac.uk organization: Department of Computer Science, University of Liverpool, Liverpool, U.K – sequence: 4 givenname: Ravi orcidid: 0000-0002-9058-434X surname: Prakash fullname: Prakash, Ravi email: ravipr@iisc.ac.in organization: Human-interactive Robotics (HiRo) Lab, Cyber-Physical Systems, Indian Institute of Science, Bangalore, India |
| BookMark | eNpNkM1rAjEQxUOxUGu999CD0PPaSSYfbntaxLaCIBTrNSRxBMXubpP10P--EYX2NA_mvRne75b16qYmxu45jDmH8mnxUY0FCDVGDRqhvGJ9gcYUaLTu_dM3bJjSHgC4EgZL1Wcvy_V8VT2Pli3VxboJzh8PLv6M5nVHsY3UOX-g0Sq6PYWuyYtq49rOdbumTnfseusOiYaXOWCfr7PV9L1YLN_m02pRBCFNV2BJxjitTBB8IkuQSgIhug3xAEZKteWlDOAmHkRQ3mu_UY6MzxmpndY4YI_nu21svo-UOrtvjrHOLy0KlasgR8wuOLtCbFKKtLVt3H3lLpaDPVGymZI9UbIXSjnycI7siOjPzrkCg4C_eYRi8Q |
| CODEN | IRALC6 |
| Cites_doi | 10.1146/annurev-control-100819-063206 10.1109/IROS58592.2024.10803060 10.1109/ICCV51070.2023.00280 10.1109/ICRA48891.2023.10160591 10.1177/0278364911406761 10.1109/ICRA48891.2023.10161068 10.1561/2300000072 10.1109/IROS47612.2022.9981810 10.3389/frobt.2024.1345693 10.1007/s11370-015-0187-9 10.1109/LRA.2024.3410155 10.1109/LRA.2024.3357432 10.1177/0278364919846363 10.1109/ICRA48891.2023.10161317 10.1145/3568162.3578623 10.48550/arXiv.1810.04805 10.1007/s11704-024-40231-1 10.15607/rss.2022.xviii.065 10.1145/3292500.3330701 10.1561/9781680834116 10.1146/annurev-control-101119-071628 10.1177/0278364920917755 10.1162/NECO_a_00393 |
| ContentType | Journal Article |
| Copyright | Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2025 |
| Copyright_xml | – notice: Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2025 |
| DBID | 97E RIA RIE AAYXX CITATION 7SC 7SP 8FD JQ2 L7M L~C L~D |
| DOI | 10.1109/LRA.2025.3606309 |
| DatabaseName | IEEE Xplore (IEEE) IEEE All-Society Periodicals Package (ASPP) 1998–Present IEEE Electronic Library (IEL) CrossRef Computer and Information Systems Abstracts Electronics & Communications Abstracts Technology Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional |
| DatabaseTitle | CrossRef Technology Research Database Computer and Information Systems Abstracts – Academic Electronics & Communications Abstracts ProQuest Computer Science Collection Computer and Information Systems Abstracts Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Professional |
| DatabaseTitleList | Technology Research Database |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Engineering |
| EISSN | 2377-3766 |
| EndPage | 11061 |
| ExternalDocumentID | 10_1109_LRA_2025_3606309 11150730 |
| Genre | orig-research |
| GrantInformation_xml | – fundername: Kotak IISc AI/ML centre and in part by ARTPARK, IISc Bangalore |
| GroupedDBID | 0R~ 97E AAJGR AASAJ AAWTH ABAZT ABQJQ ABVLG ACGFS AGQYO AGSQL AHBIQ AKJIK AKQYR ALMA_UNASSIGNED_HOLDINGS ATWAV BEFXN BFFAM BGNUA BKEBE BPEOZ EBS EJD IFIPE IPLJI JAVBF KQ8 M43 M~E O9- OCL RIA RIE AAYXX CITATION 7SC 7SP 8FD JQ2 L7M L~C L~D |
| ID | FETCH-LOGICAL-c247t-39e77a657c2184904540e33ade1c07445f194c0a8b02c5bb6bd5ae7b7a646a663 |
| IEDL.DBID | RIE |
| ISICitedReferencesCount | 0 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001574207700004&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 2377-3766 |
| IngestDate | Sat Nov 22 13:41:00 EST 2025 Sat Nov 29 07:28:31 EST 2025 Wed Oct 01 07:05:10 EDT 2025 |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 11 |
| Language | English |
| License | https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html https://doi.org/10.15223/policy-029 https://doi.org/10.15223/policy-037 |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c247t-39e77a657c2184904540e33ade1c07445f194c0a8b02c5bb6bd5ae7b7a646a663 |
| Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
| ORCID | 0009-0003-2651-1757 0000-0002-1449-211X 0009-0001-1662-9378 0000-0002-9058-434X |
| PQID | 3251523133 |
| PQPubID | 4437225 |
| PageCount | 8 |
| ParticipantIDs | ieee_primary_11150730 crossref_primary_10_1109_LRA_2025_3606309 proquest_journals_3251523133 |
| PublicationCentury | 2000 |
| PublicationDate | 2025-11-01 |
| PublicationDateYYYYMMDD | 2025-11-01 |
| PublicationDate_xml | – month: 11 year: 2025 text: 2025-11-01 day: 01 |
| PublicationDecade | 2020 |
| PublicationPlace | Piscataway |
| PublicationPlace_xml | – name: Piscataway |
| PublicationTitle | IEEE robotics and automation letters |
| PublicationTitleAbbrev | LRA |
| PublicationYear | 2025 |
| Publisher | IEEE The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
| Publisher_xml | – name: IEEE – name: The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
| References | ref13 ref12 Szot (ref19) 2023 ref14 ref31 ref30 Medeiros (ref37) 2023 ref10 ref32 Lowry (ref36) 2014 Yu (ref5) 2023 ref2 ref1 ref17 Team (ref35) 2023 ref16 Hurst (ref33) 2024 Ahn (ref21) 2022; 205 Anjomshoae (ref11) 2019 Rawlings (ref8) 2017 ref24 Huang (ref23) 2023 ref26 ref25 ref20 Huang (ref4) 2023 ref22 (ref34) 2024 ref28 Paraschos (ref15) 2013; 26 ref27 ref29 ref7 ref9 ref3 ref6 Driess (ref18) 2023 |
| References_xml | – ident: ref9 doi: 10.1146/annurev-control-100819-063206 – ident: ref17 doi: 10.1109/IROS58592.2024.10803060 – ident: ref2 doi: 10.1109/ICCV51070.2023.00280 – ident: ref22 doi: 10.1109/ICRA48891.2023.10160591 – year: 2024 ident: ref33 article-title: GPT-4o system card – ident: ref7 doi: 10.1177/0278364911406761 – start-page: 540 volume-title: Proc. Conf. Robot Learn. year: 2023 ident: ref4 article-title: VoxPoser: Composable 3D value maps for robotic manipulation with language models – start-page: 374 volume-title: Proc. Conf. Robot Learn. year: 2023 ident: ref5 article-title: Language to rewards for robotic skill synthesis – volume: 26 start-page: 2616 volume-title: Proc. Adv. Neural Inf. Process. Syst. year: 2013 ident: ref15 article-title: Probabilistic movement primitives – start-page: 8469 volume-title: Proc. Int. Conf. Mach. Learn. year: 2023 ident: ref18 article-title: PaLM-E: An embodied multimodal language model – volume-title: Proc. 12th Int. Conf. Learn. Representations year: 2023 ident: ref19 article-title: Large language models as generalizable policies for embodied tasks – ident: ref30 doi: 10.1109/ICRA48891.2023.10161068 – start-page: 1769 volume-title: Proc. Conf. Robot Learn. year: 2023 ident: ref23 article-title: Inner monologue: Embodied reasoning through planning with language models – volume: 205 start-page: 287 volume-title: Proc. 6th Conf. Robot Learn. year: 2022 ident: ref21 article-title: Do as i can, not as i say: Grounding language in robotic affordances – start-page: 1078 volume-title: Proc. 18th Int. Conf. Auton. Agents Multiagent Syst. year: 2019 ident: ref11 article-title: Explainable agents and robots: Results from a systematic literature review – ident: ref10 doi: 10.1561/2300000072 – year: 2014 ident: ref36 article-title: Concepts and applications of inferential statistics – ident: ref29 doi: 10.1109/IROS47612.2022.9981810 – ident: ref31 doi: 10.3389/frobt.2024.1345693 – year: 2023 ident: ref35 article-title: Gemini: A family of highly capable multimodal models – ident: ref13 doi: 10.1007/s11370-015-0187-9 – ident: ref6 doi: 10.1109/LRA.2024.3410155 – ident: ref20 doi: 10.1109/LRA.2024.3357432 – ident: ref16 doi: 10.1177/0278364919846363 – year: 2023 ident: ref37 article-title: LangSAM: Language segment-anything – ident: ref3 doi: 10.1109/ICRA48891.2023.10161317 – ident: ref26 doi: 10.1145/3568162.3578623 – ident: ref27 doi: 10.48550/arXiv.1810.04805 – ident: ref1 doi: 10.1007/s11704-024-40231-1 – volume-title: Model Predictive Control: Theory, Computation, and Design year: 2017 ident: ref8 – year: 2024 ident: ref34 article-title: Claude haiku [large language model] – ident: ref28 doi: 10.15607/rss.2022.xviii.065 – ident: ref32 doi: 10.1145/3292500.3330701 – ident: ref12 doi: 10.1561/9781680834116 – ident: ref24 doi: 10.1146/annurev-control-101119-071628 – ident: ref25 doi: 10.1177/0278364920917755 – ident: ref14 doi: 10.1162/NECO_a_00393 |
| SSID | ssj0001527395 |
| Score | 2.307959 |
| Snippet | Adapting trajectories to dynamic situations and user preferences is crucial for robot operation in unstructured environments with non-expert users. Natural... |
| SourceID | proquest crossref ieee |
| SourceType | Aggregation Database Index Database Publisher |
| StartPage | 11054 |
| SubjectTerms | big data in robotics and automation Codes Dynamics Grounding human-robot collaboration Large language models Motion and path planning Motion planning Planning Robot arms Robot sensing systems Robots Service robots Training Trajectories Trajectory Translation |
| Title | OVITA: Open-Vocabulary Interpretable Trajectory Adaptations |
| URI | https://ieeexplore.ieee.org/document/11150730 https://www.proquest.com/docview/3251523133 |
| Volume | 10 |
| WOSCitedRecordID | wos001574207700004&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVIEE databaseName: IEEE Electronic Library (IEL) customDbUrl: eissn: 2377-3766 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0001527395 issn: 2377-3766 databaseCode: RIE dateStart: 20160101 isFulltext: true titleUrlDefault: https://ieeexplore.ieee.org/ providerName: IEEE – providerCode: PRVHPJ databaseName: ROAD: Directory of Open Access Scholarly Resources customDbUrl: eissn: 2377-3766 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0001527395 issn: 2377-3766 databaseCode: M~E dateStart: 20160101 isFulltext: true titleUrlDefault: https://road.issn.org providerName: ISSN International Centre |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1NS8MwGA5ueNCDnxOnc_TgxUO3tEmWRU9FNhTmFJljt5KPt6DINvYh7OJvN0k7pogHbz00JTxp8r7P-_EEoUu79yCKwGX9OQ9pFulQaWMXhDkpGY1j8KGBYY_3--3RSDwVzeq-FwYAfPEZNNyjz-WbiV66UFkz8u4LsQy9xDnPm7U2ARUnJSbYOhWJRbP3nFgCGLMGaTllKfHD9Pi7VH4dwN6qdPf_OZ8DtFe4j0GSr_ch2oLxEdr9Jip4jG4eh_eD5DpwpSLh0Joq5SpNV8GmvFC9Q2Bt1JsP2K-CxMhpnpCfV9BLtzO4vQuLKxJCHVO-CIkAzmWLce2omvB6ekCINBBp6xxQlkWCaizbCseaKdVShkngyo6hLWm9jRNUHk_GcIoCJklGMJbUyIxmljjI2AgKbU4NzhhEVXS1Ri-d5koYqWcQWKQW6dQhnRZIV1HFobV5rwCqimprvNNir8xTYl0sS4ctWT77Y9g52nFfz1sAa6i8mC3hAm3rj8XrfFZHpYfPTt3_DF-ts7Cy |
| linkProvider | IEEE |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1NS8MwGA4yBfXgtzid2oMXD92aNmkWPRVxbFinyBy7hSR9C4psYx_C_r1J2jFFPHjroaHhSZP3fd6PJwhdmb0HGIPN-jPmkxxrX-nMLAi1UjI6CMGFBvop63abgwF_LpvVXS8MALjiM6jbR5fLz0Z6bkNlDezcl8gw9HVKSIiLdq1VSMWKiXG6TEYGvJG-JIYChrQexVZbiv8wPu42lV9HsLMrrd1_zmgP7ZQOpJcUK76P1mB4gLa_yQoeotunfqeX3Hi2WMTvG2OlbK3pwlsVGKoP8IyVench-4WXZHJcpOSnR-i1dd-7a_vlJQm-Dgmb-REHxmRMmbZkjTtFPYgimQHWxj0gNMec6EA2VRBqqlSsMiqBKTOGxNL4G8eoMhwN4QR5VEZ5FASSZDInuaEOMsw4gSYjWZBTwFV0vURPjAstDOE4RMCFQVpYpEWJdBUdWbRW75VAVVFtibcod8tURMbJMoTY0OXTP4Zdos127zEVaaf7cIa27JeKhsAaqswmczhHG_pz9jadXLhf4gvcybLI |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=OVITA%3A+Open-Vocabulary+Interpretable+Trajectory+Adaptations&rft.jtitle=IEEE+robotics+and+automation+letters&rft.au=Maurya%2C+Anurag&rft.au=Ghosh%2C+Tashmoy&rft.au=Nguyen%2C+Anh&rft.au=Prakash%2C+Ravi&rft.date=2025-11-01&rft.pub=IEEE&rft.eissn=2377-3766&rft.volume=10&rft.issue=11&rft.spage=11054&rft.epage=11061&rft_id=info:doi/10.1109%2FLRA.2025.3606309&rft.externalDocID=11150730 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2377-3766&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2377-3766&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2377-3766&client=summon |