Individualizing Head-Related Transfer Functions for Binaural Acoustic Applications
A Head Related Transfer Function (HRTF) characterizes how a hu-man ear receives sounds from a point in space, and depends on the shapes of one's head, pinna, and torso. Accurate estimations of HRTFs for human subjects are crucial in enabling binaural acoustic applications such as sound localiza...
Saved in:
| Published in: | 2022 21st ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN) pp. 105 - 117 |
|---|---|
| Main Authors: | , , |
| Format: | Conference Proceeding |
| Language: | English |
| Published: |
IEEE
01.05.2022
|
| Subjects: | |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Abstract | A Head Related Transfer Function (HRTF) characterizes how a hu-man ear receives sounds from a point in space, and depends on the shapes of one's head, pinna, and torso. Accurate estimations of HRTFs for human subjects are crucial in enabling binaural acoustic applications such as sound localization and 3D sound spatialization. Unfortunately, conventional approaches for HRTF estimation rely on specialized devices or lengthy measurement processes. This work proposes a novel lightweight method for HRTF individual-ization that can be implemented using commercial-off-the-shelf components and performed by average users in home settings. The proposed method has two key components: a generative neural network model that can be individualized to predict HRTFs of new subjects from sparse measurements, and a lightweight measurement procedure that collects HRTF data from spatial locations. Exten-sive experiments using a public dataset and in house measurement data from 10 subjects of different ages and genders, show that the individualized models significantly outperform a baseline model in the accuracy of predicted HRTFs. To further demonstrate the advantages of individualized HRTFs, we implement two prototype applications for binaural localization and acoustic spatialization. We find that the performance of a localization model is improved by 15° after trained with individualized HRTFs. Furthermore, in hearing tests, the success rate of correctly identifying the azimuth direction of incoming sounds increases by 183% after individualization. |
|---|---|
| AbstractList | A Head Related Transfer Function (HRTF) characterizes how a hu-man ear receives sounds from a point in space, and depends on the shapes of one's head, pinna, and torso. Accurate estimations of HRTFs for human subjects are crucial in enabling binaural acoustic applications such as sound localization and 3D sound spatialization. Unfortunately, conventional approaches for HRTF estimation rely on specialized devices or lengthy measurement processes. This work proposes a novel lightweight method for HRTF individual-ization that can be implemented using commercial-off-the-shelf components and performed by average users in home settings. The proposed method has two key components: a generative neural network model that can be individualized to predict HRTFs of new subjects from sparse measurements, and a lightweight measurement procedure that collects HRTF data from spatial locations. Exten-sive experiments using a public dataset and in house measurement data from 10 subjects of different ages and genders, show that the individualized models significantly outperform a baseline model in the accuracy of predicted HRTFs. To further demonstrate the advantages of individualized HRTFs, we implement two prototype applications for binaural localization and acoustic spatialization. We find that the performance of a localization model is improved by 15° after trained with individualized HRTFs. Furthermore, in hearing tests, the success rate of correctly identifying the azimuth direction of incoming sounds increases by 183% after individualization. |
| Author | Zheng, Rong El-Mohandes, Awny M. Zandi, Navid H. |
| Author_xml | – sequence: 1 givenname: Navid H. surname: Zandi fullname: Zandi, Navid H. email: hossen6@mcmaster.ca organization: McMaster University,Hamilton,Canada – sequence: 2 givenname: Awny M. surname: El-Mohandes fullname: El-Mohandes, Awny M. email: awny.elmohandes@mans.edu.eg organization: Mansoura University,Mansoura,Egypt – sequence: 3 givenname: Rong surname: Zheng fullname: Zheng, Rong email: rzheng@mcmaster.ca organization: McMaster University,Hamilton,Canada |
| BookMark | eNotjEtOwzAUAI0ECyg9ASx8gZTnv70MFaWRKkClrCvHH2QpOJGTIMHpQYXVLGY0V-g89zkgdEtgRQiYu-bl9UlwxvSKAqUrACDyDC2N0kRKwY2kXF2ifZN9-kx-tl36Tvkdb4P11T50dgoeH4rNYwwFb-bsptTnEce-4PuU7Vxsh2vXz-OUHK6HoUvOnpJrdBFtN4blPxfobfNwWG-r3fNjs653VaLApoowrgOVTChvuHGgmNXgndYxtpx46o33Kuq2dZQLEYzjGlrlQAjBvZKCLdDN3zeFEI5DSR-2fB2NphJ-7Q_t1E4H |
| CODEN | IEEPAD |
| ContentType | Conference Proceeding |
| DBID | 6IE 6IL CBEJK RIE RIL |
| DOI | 10.1109/IPSN54338.2022.00016 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| EISBN | 9781665496247 166549624X |
| EndPage | 117 |
| ExternalDocumentID | 9826065 |
| Genre | orig-research |
| GroupedDBID | 6IE 6IL CBEJK RIE RIL |
| ID | FETCH-LOGICAL-i203t-1348e26357d949c073a80dc88ffb41d2d9dd7f8bbc2455e9c480b7c05554d7653 |
| IEDL.DBID | RIE |
| ISICitedReferencesCount | 5 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000855254100009&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| IngestDate | Thu Jun 29 18:36:42 EDT 2023 |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-i203t-1348e26357d949c073a80dc88ffb41d2d9dd7f8bbc2455e9c480b7c05554d7653 |
| PageCount | 13 |
| ParticipantIDs | ieee_primary_9826065 |
| PublicationCentury | 2000 |
| PublicationDate | 2022-May |
| PublicationDateYYYYMMDD | 2022-05-01 |
| PublicationDate_xml | – month: 05 year: 2022 text: 2022-May |
| PublicationDecade | 2020 |
| PublicationTitle | 2022 21st ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN) |
| PublicationTitleAbbrev | IPSN |
| PublicationYear | 2022 |
| Publisher | IEEE |
| Publisher_xml | – name: IEEE |
| Score | 1.8554622 |
| Snippet | A Head Related Transfer Function (HRTF) characterizes how a hu-man ear receives sounds from a point in space, and depends on the shapes of one's head, pinna,... |
| SourceID | ieee |
| SourceType | Publisher |
| StartPage | 105 |
| SubjectTerms | Acoustic applications Binaural Localization Conditional Variational AutoEncoder (CVAE) Data acquisition Estimation Head-Related Transfer Function (HRTF) Location awareness Neural networks Sound Spatialization Torso Transfer functions |
| Title | Individualizing Head-Related Transfer Functions for Binaural Acoustic Applications |
| URI | https://ieeexplore.ieee.org/document/9826065 |
| WOSCitedRecordID | wos000855254100009&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV07a4MhFL0koUOntiSlbxw61sYYv6hjWhoSKCH0Rbbg4wrf8qXk0aG_vmpCmqFLNxFEuCrnqvecA3CbFESMC0gtN44m421qYiJKueEx2SicKLKLwsezHI_VdKonNbjbcWEQMRef4X1q5r98P3fr9FTW1jEXjpBZh7qUcsPV2rLhOky3R5PXcSHilSve-niW4Uwm5nueKRkyBkf_m-wYWr_cOzLZocoJ1LBqwstox5sqv2M3GcalobmQDT3JeBPHkkEEqbyPSExFyUNZmSSqQfpunj27SH_vt7oF74Ont8ch3boh0JKzbvKMFwqTdIz0WmgXj6ZRzDulQrCi47nX3sugrHVcFAVqJxSz0iVBL-Flr-ieQqOaV3gGhPUMR4NeKu1FMEH5nnPdYDmzDIPHc2imeMw-N4IXs20oLv7uvoTDFPBNFeAVNFaLNV7DgftalcvFTV6lH9eSlug |
| linkProvider | IEEE |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1NTwIxEJ0gmuhJDRi_7cGjK6V02faIRgIRN0TRcCP9mCZ7WQiCB3-9bSHIwYu3pknTZNrmTdt57wHcBgURZRwmmimTBOPtRPlENGGK-WQjNTyNLgofgyzPxXgshxW423BhEDEWn-F9aMa_fDs1y_BU1pA-F_aQuQO7KeesuWJrrflwTSob_eFbnnJ_6fL3PhaFOION-ZZrSgSN7uH_pjuC-i_7jgw3uHIMFSxr8NrfMKeKb99Nen5xkljKhpZExPFjSdfDVNxJxCej5KEoVZDVIB0zja5dpLP1X12H9-7T6LGXrP0QkoLRVnCN5wKDeExmJZfGH04lqDVCOKd50zIrrc2c0NownqYoDRdUZyZIenGbtdPWCVTLaYmnQGhbMVRoMyEtd8oJ2zam5TSjmqKzeAa1EI_JbCV5MVmH4vzv7hvY741eBpNBP3--gIMQ_FVN4CVUF_MlXsGe-VoUn_PruGI_3baaLw |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2022+21st+ACM%2FIEEE+International+Conference+on+Information+Processing+in+Sensor+Networks+%28IPSN%29&rft.atitle=Individualizing+Head-Related+Transfer+Functions+for+Binaural+Acoustic+Applications&rft.au=Zandi%2C+Navid+H.&rft.au=El-Mohandes%2C+Awny+M.&rft.au=Zheng%2C+Rong&rft.date=2022-05-01&rft.pub=IEEE&rft.spage=105&rft.epage=117&rft_id=info:doi/10.1109%2FIPSN54338.2022.00016&rft.externalDocID=9826065 |