Evaluating the Impact of Personalized Value Alignment in Human-Robot Interaction: Insights into Trust and Team Performance Outcomes
This paper examines the effect of real-time, personalized alignment of a robot's reward function to the human's values on trust and team performance. We present and compare three distinct robot interaction strategies: a non-learner strategy where the robot presumes the human's reward...
Saved in:
| Published in: | 2024 19th ACM/IEEE International Conference on Human-Robot Interaction (HRI) pp. 32 - 41 |
|---|---|
| Main Authors: | , , , |
| Format: | Conference Proceeding |
| Language: | English |
| Published: |
ACM
11.03.2024
|
| Subjects: | |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Abstract | This paper examines the effect of real-time, personalized alignment of a robot's reward function to the human's values on trust and team performance. We present and compare three distinct robot interaction strategies: a non-learner strategy where the robot presumes the human's reward function mirrors its own; a non-adaptive-learner strategy in which the robot learns the human's reward function for trust estimation and human behavior modeling, but still optimizes its own reward function; and an adaptive-learner strategy in which the robot learns the human's reward function and adopts it as its own. Two human-subject experiments with a total number of N = 54 participants were conducted. In both experiments, the human-robot team searches for potential threats in a town. The team sequentially goes through search sites to look for threats. We model the interaction between the human and the robot as a trust-aware Markov Decision Process (trust-aware MDP) and use Bayesian Inverse Reinforcement Learning (IRL) to estimate the reward weights of the human as they interact with the robot. In Experiment 1, we start our learning algorithm with an informed prior of the human's values/goals. In Experiment 2, we start the learning algorithm with an uninformed prior. Results indicate that when starting with a good informed prior, personalized value alignment does not seem to benefit trust or team performance. On the other hand, when an informed prior is unavailable, alignment to the human's values leads to high trust and higher perceived performance while maintaining the same objective team performance.CCS CONCEPTS* Human-centered computing → Empirical studies in HCI; * Computer systems organization → Robotic autonomy. |
|---|---|
| AbstractList | This paper examines the effect of real-time, personalized alignment of a robot's reward function to the human's values on trust and team performance. We present and compare three distinct robot interaction strategies: a non-learner strategy where the robot presumes the human's reward function mirrors its own; a non-adaptive-learner strategy in which the robot learns the human's reward function for trust estimation and human behavior modeling, but still optimizes its own reward function; and an adaptive-learner strategy in which the robot learns the human's reward function and adopts it as its own. Two human-subject experiments with a total number of N = 54 participants were conducted. In both experiments, the human-robot team searches for potential threats in a town. The team sequentially goes through search sites to look for threats. We model the interaction between the human and the robot as a trust-aware Markov Decision Process (trust-aware MDP) and use Bayesian Inverse Reinforcement Learning (IRL) to estimate the reward weights of the human as they interact with the robot. In Experiment 1, we start our learning algorithm with an informed prior of the human's values/goals. In Experiment 2, we start the learning algorithm with an uninformed prior. Results indicate that when starting with a good informed prior, personalized value alignment does not seem to benefit trust or team performance. On the other hand, when an informed prior is unavailable, alignment to the human's values leads to high trust and higher perceived performance while maintaining the same objective team performance.CCS CONCEPTS* Human-centered computing → Empirical studies in HCI; * Computer systems organization → Robotic autonomy. |
| Author | Yang, X. Jessie Bhat, Shreyas Lyons, Joseph B. Shi, Cong |
| Author_xml | – sequence: 1 givenname: Shreyas surname: Bhat fullname: Bhat, Shreyas email: shreyasb@umich.edu organization: University of Michigan,Ann Arbor,Michigan,USA – sequence: 2 givenname: Joseph B. surname: Lyons fullname: Lyons, Joseph B. email: joseph.lyons.6@us.af.mil organization: Air Force Research Laboratory,Dayton,Ohio,USA – sequence: 3 givenname: Cong surname: Shi fullname: Shi, Cong email: congshi@bus.miami.edu organization: Miami Herbert Business School,Miami,Florida,USA – sequence: 4 givenname: X. Jessie surname: Yang fullname: Yang, X. Jessie email: xijyang@umich.edu organization: University of Michigan,Ann Arbor,Michigan,USA |
| BookMark | eNotkE1LAzEURSMoqLVrNy7yB0aTybe7UqotFCpS3ZbXmTdtYCYpk4ygW_-4I7q6XLjnLO41OQ8xICG3nN1zLtWD0Jw5Y-6FFtKV_IxMnXFWMmaYKEt1SaYp-T1T0nEmlb0i34sPaAfIPhxoPiJddSeoMo0NfcE-xQCt_8Kavo8jpLPWH0KHIVMf6HLoIBSvcR8zXYWM_cj5GB7HkvzhmNM4ypFu-yFlCqGmW4Tu19rEfiQrpJshV7HDdEMuGmgTTv9zQt6eFtv5slhvnlfz2bqAUtpcgLKGGVuXvJKWOw1l4yQIAw0yDgLAaKOxUhIqY5ysFRjhtGqsrJkRtRQTcvfn9Yi4O_W-g_5zx5keTxNO_ADcEGHL |
| ContentType | Conference Proceeding |
| DBID | 6IE 6IL CBEJK RIE RIL |
| DOI | 10.1145/3610977.3634921 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Xplore Digital Libary (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| EISBN | 9798400703225 |
| EndPage | 41 |
| ExternalDocumentID | 10661039 |
| Genre | orig-research |
| GrantInformation_xml | – fundername: Air Force Office of Scientific Research funderid: 10.13039/100000181 |
| GroupedDBID | 6IE 6IL ACM ALMA_UNASSIGNED_HOLDINGS APO CBEJK LHSKQ RIE RIL |
| ID | FETCH-LOGICAL-a248t-a587078d21c48196a2f94a37afe01a3aa7676ec54ac7794d5a73965f84d073d43 |
| IEDL.DBID | RIE |
| ISICitedReferencesCount | 9 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=001239977500007&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| IngestDate | Tue May 06 03:31:43 EDT 2025 |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-a248t-a587078d21c48196a2f94a37afe01a3aa7676ec54ac7794d5a73965f84d073d43 |
| PageCount | 10 |
| ParticipantIDs | ieee_primary_10661039 |
| PublicationCentury | 2000 |
| PublicationDate | 2024-March-11 |
| PublicationDateYYYYMMDD | 2024-03-11 |
| PublicationDate_xml | – month: 03 year: 2024 text: 2024-March-11 day: 11 |
| PublicationDecade | 2020 |
| PublicationTitle | 2024 19th ACM/IEEE International Conference on Human-Robot Interaction (HRI) |
| PublicationTitleAbbrev | HRI |
| PublicationYear | 2024 |
| Publisher | ACM |
| Publisher_xml | – name: ACM |
| SSID | ssib054910458 |
| Score | 2.0092578 |
| Snippet | This paper examines the effect of real-time, personalized alignment of a robot's reward function to the human's values on trust and team performance. We... |
| SourceID | ieee |
| SourceType | Publisher |
| StartPage | 32 |
| SubjectTerms | Estimation Human computer interaction Human-robot interaction Human-robot teaming Markov decision processes Organizations Real-time systems Reinforcement learning trust-aware decision-making value-alignment |
| Title | Evaluating the Impact of Personalized Value Alignment in Human-Robot Interaction: Insights into Trust and Team Performance Outcomes |
| URI | https://ieeexplore.ieee.org/document/10661039 |
| WOSCitedRecordID | wos001239977500007&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3PS8MwFA46PHhSceJvcvDauTRpknoT2VCQOWTKbuMteRkDbWXtPHj1HzfJVsWDB09tQ0ggP_q-916-L4RccKVzdFNMDAObCOV9Vs3AJc5JaYQH9DaLROF7NRjo8TgfrsnqkQuDiPHwGXbCa8zl29IsQ6jM73BvTbo83ySbSqkVWatZPN7PYSHpt5bvYSK75EFKXKkOl0GCj_26PyWaj_7OPzveJe0fIh4dfpuYPbKBxT757K0VuosZ9fCN3kWiIy0dHTbI-gMtffaVkF6_zGcx4U_nBY0R--SxnJY1jaHAFavhyn9UwUmvfKW6pKPAw6BQWDpCeA2tNtwC-rCs_RLFqk2e-r3RzW2yvkshgVToOoFMB10fmzIjPAiQkLpcAFfgsMuAAyipJJpMgFF-i9oMFM9l5rSw_idgBT8graIs8JBQDilqjwMFKBBTDVp2tUNt_DMPedYj0g4jOHlbyWVMmsE7_qP8hGynHimEg12MnZJWvVjiGdky7_W8WpzHSf4CpbSqMw |
| linkProvider | IEEE |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV09T8MwELWgIMEEiCK-8cCaUsdO7LAh1KoVpVQooG7VNT5XlSBBbcrAyh_HdlMQAwNTEsuyJX_k3t35PRNyyaVK0IwxyBjoQEjrsyoGJjAmjjNhAb2OPFG4J_t9NRwmg4qs7rkwiOgPn2HDvfpcvi6yhQuV2R1urUmTJ-tkIxIiZEu61mr5WE-HubRfJeDDRHTFnZi4lA0eOxE-9usGFW9A2jv_7HqX1H-oeHTwbWT2yBrm--SzVWl05xNqARzteqojLQwdrLD1B2r6bCshvXmZTnzKn05z6mP2wWMxLkrqg4FLXsO1_Zg7N31uK5UFTR0Tg0KuaYrw6lpdsQvow6K0ixTndfLUbqW3naC6TSGAUKgygEg5ZR8dskxYGBBDaBIBXILBJgMOIGMZYxYJyKTdpDoCyZM4Mkpo-xvQgh-QWl7keEgohxCVRYICJIixAhU3lUGV2WfiMq1HpO5GcPS2FMwYrQbv-I_yC7LVSe97o163f3dCtkOLG9wxL8ZOSa2cLfCMbGbv5XQ-O_cT_gWsNq16 |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2024+19th+ACM%2FIEEE+International+Conference+on+Human-Robot+Interaction+%28HRI%29&rft.atitle=Evaluating+the+Impact+of+Personalized+Value+Alignment+in+Human-Robot+Interaction%3A+Insights+into+Trust+and+Team+Performance+Outcomes&rft.au=Bhat%2C+Shreyas&rft.au=Lyons%2C+Joseph+B.&rft.au=Shi%2C+Cong&rft.au=Yang%2C+X.+Jessie&rft.date=2024-03-11&rft.pub=ACM&rft.spage=32&rft.epage=41&rft_id=info:doi/10.1145%2F3610977.3634921&rft.externalDocID=10661039 |