Individualizing Head-Related Transfer Functions for Binaural Acoustic Applications

A Head Related Transfer Function (HRTF) characterizes how a hu-man ear receives sounds from a point in space, and depends on the shapes of one's head, pinna, and torso. Accurate estimations of HRTFs for human subjects are crucial in enabling binaural acoustic applications such as sound localiza...

Full description

Saved in:
Bibliographic Details
Published in:2022 21st ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN) pp. 105 - 117
Main Authors: Zandi, Navid H., El-Mohandes, Awny M., Zheng, Rong
Format: Conference Proceeding
Language:English
Published: IEEE 01.05.2022
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Abstract A Head Related Transfer Function (HRTF) characterizes how a hu-man ear receives sounds from a point in space, and depends on the shapes of one's head, pinna, and torso. Accurate estimations of HRTFs for human subjects are crucial in enabling binaural acoustic applications such as sound localization and 3D sound spatialization. Unfortunately, conventional approaches for HRTF estimation rely on specialized devices or lengthy measurement processes. This work proposes a novel lightweight method for HRTF individual-ization that can be implemented using commercial-off-the-shelf components and performed by average users in home settings. The proposed method has two key components: a generative neural network model that can be individualized to predict HRTFs of new subjects from sparse measurements, and a lightweight measurement procedure that collects HRTF data from spatial locations. Exten-sive experiments using a public dataset and in house measurement data from 10 subjects of different ages and genders, show that the individualized models significantly outperform a baseline model in the accuracy of predicted HRTFs. To further demonstrate the advantages of individualized HRTFs, we implement two prototype applications for binaural localization and acoustic spatialization. We find that the performance of a localization model is improved by 15° after trained with individualized HRTFs. Furthermore, in hearing tests, the success rate of correctly identifying the azimuth direction of incoming sounds increases by 183% after individualization.
AbstractList A Head Related Transfer Function (HRTF) characterizes how a hu-man ear receives sounds from a point in space, and depends on the shapes of one's head, pinna, and torso. Accurate estimations of HRTFs for human subjects are crucial in enabling binaural acoustic applications such as sound localization and 3D sound spatialization. Unfortunately, conventional approaches for HRTF estimation rely on specialized devices or lengthy measurement processes. This work proposes a novel lightweight method for HRTF individual-ization that can be implemented using commercial-off-the-shelf components and performed by average users in home settings. The proposed method has two key components: a generative neural network model that can be individualized to predict HRTFs of new subjects from sparse measurements, and a lightweight measurement procedure that collects HRTF data from spatial locations. Exten-sive experiments using a public dataset and in house measurement data from 10 subjects of different ages and genders, show that the individualized models significantly outperform a baseline model in the accuracy of predicted HRTFs. To further demonstrate the advantages of individualized HRTFs, we implement two prototype applications for binaural localization and acoustic spatialization. We find that the performance of a localization model is improved by 15° after trained with individualized HRTFs. Furthermore, in hearing tests, the success rate of correctly identifying the azimuth direction of incoming sounds increases by 183% after individualization.
Author Zheng, Rong
El-Mohandes, Awny M.
Zandi, Navid H.
Author_xml – sequence: 1
  givenname: Navid H.
  surname: Zandi
  fullname: Zandi, Navid H.
  email: hossen6@mcmaster.ca
  organization: McMaster University,Hamilton,Canada
– sequence: 2
  givenname: Awny M.
  surname: El-Mohandes
  fullname: El-Mohandes, Awny M.
  email: awny.elmohandes@mans.edu.eg
  organization: Mansoura University,Mansoura,Egypt
– sequence: 3
  givenname: Rong
  surname: Zheng
  fullname: Zheng, Rong
  email: rzheng@mcmaster.ca
  organization: McMaster University,Hamilton,Canada
BookMark eNotjEtOwzAUAI0ECyg9ASx8gZTnv70MFaWRKkClrCvHH2QpOJGTIMHpQYXVLGY0V-g89zkgdEtgRQiYu-bl9UlwxvSKAqUrACDyDC2N0kRKwY2kXF2ifZN9-kx-tl36Tvkdb4P11T50dgoeH4rNYwwFb-bsptTnEce-4PuU7Vxsh2vXz-OUHK6HoUvOnpJrdBFtN4blPxfobfNwWG-r3fNjs653VaLApoowrgOVTChvuHGgmNXgndYxtpx46o33Kuq2dZQLEYzjGlrlQAjBvZKCLdDN3zeFEI5DSR-2fB2NphJ-7Q_t1E4H
CODEN IEEPAD
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/IPSN54338.2022.00016
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9781665496247
166549624X
EndPage 117
ExternalDocumentID 9826065
Genre orig-research
GroupedDBID 6IE
6IL
CBEJK
RIE
RIL
ID FETCH-LOGICAL-i203t-1348e26357d949c073a80dc88ffb41d2d9dd7f8bbc2455e9c480b7c05554d7653
IEDL.DBID RIE
ISICitedReferencesCount 5
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000855254100009&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
IngestDate Thu Jun 29 18:36:42 EDT 2023
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i203t-1348e26357d949c073a80dc88ffb41d2d9dd7f8bbc2455e9c480b7c05554d7653
PageCount 13
ParticipantIDs ieee_primary_9826065
PublicationCentury 2000
PublicationDate 2022-May
PublicationDateYYYYMMDD 2022-05-01
PublicationDate_xml – month: 05
  year: 2022
  text: 2022-May
PublicationDecade 2020
PublicationTitle 2022 21st ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN)
PublicationTitleAbbrev IPSN
PublicationYear 2022
Publisher IEEE
Publisher_xml – name: IEEE
Score 1.8554622
Snippet A Head Related Transfer Function (HRTF) characterizes how a hu-man ear receives sounds from a point in space, and depends on the shapes of one's head, pinna,...
SourceID ieee
SourceType Publisher
StartPage 105
SubjectTerms Acoustic applications
Binaural Localization
Conditional Variational AutoEncoder (CVAE)
Data acquisition
Estimation
Head-Related Transfer Function (HRTF)
Location awareness
Neural networks
Sound Spatialization
Torso
Transfer functions
Title Individualizing Head-Related Transfer Functions for Binaural Acoustic Applications
URI https://ieeexplore.ieee.org/document/9826065
WOSCitedRecordID wos000855254100009&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV07a4MhFL0koUOntiSlbxw61sYYv6hjWhoSKCH0Rbbg4wrf8qXk0aG_vmpCmqFLNxFEuCrnqvecA3CbFESMC0gtN44m421qYiJKueEx2SicKLKLwsezHI_VdKonNbjbcWEQMRef4X1q5r98P3fr9FTW1jEXjpBZh7qUcsPV2rLhOky3R5PXcSHilSve-niW4Uwm5nueKRkyBkf_m-wYWr_cOzLZocoJ1LBqwstox5sqv2M3GcalobmQDT3JeBPHkkEEqbyPSExFyUNZmSSqQfpunj27SH_vt7oF74Ont8ch3boh0JKzbvKMFwqTdIz0WmgXj6ZRzDulQrCi47nX3sugrHVcFAVqJxSz0iVBL-Flr-ieQqOaV3gGhPUMR4NeKu1FMEH5nnPdYDmzDIPHc2imeMw-N4IXs20oLv7uvoTDFPBNFeAVNFaLNV7DgftalcvFTV6lH9eSlug
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1NTwIxEJ0gmuhJDRi_7cGjK6V02faIRgIRN0TRcCP9mCZ7WQiCB3-9bSHIwYu3pknTZNrmTdt57wHcBgURZRwmmimTBOPtRPlENGGK-WQjNTyNLgofgyzPxXgshxW423BhEDEWn-F9aMa_fDs1y_BU1pA-F_aQuQO7KeesuWJrrflwTSob_eFbnnJ_6fL3PhaFOION-ZZrSgSN7uH_pjuC-i_7jgw3uHIMFSxr8NrfMKeKb99Nen5xkljKhpZExPFjSdfDVNxJxCej5KEoVZDVIB0zja5dpLP1X12H9-7T6LGXrP0QkoLRVnCN5wKDeExmJZfGH04lqDVCOKd50zIrrc2c0NownqYoDRdUZyZIenGbtdPWCVTLaYmnQGhbMVRoMyEtd8oJ2zam5TSjmqKzeAa1EI_JbCV5MVmH4vzv7hvY741eBpNBP3--gIMQ_FVN4CVUF_MlXsGe-VoUn_PruGI_3baaLw
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2022+21st+ACM%2FIEEE+International+Conference+on+Information+Processing+in+Sensor+Networks+%28IPSN%29&rft.atitle=Individualizing+Head-Related+Transfer+Functions+for+Binaural+Acoustic+Applications&rft.au=Zandi%2C+Navid+H.&rft.au=El-Mohandes%2C+Awny+M.&rft.au=Zheng%2C+Rong&rft.date=2022-05-01&rft.pub=IEEE&rft.spage=105&rft.epage=117&rft_id=info:doi/10.1109%2FIPSN54338.2022.00016&rft.externalDocID=9826065