Can a Phone Hear the Shape of a Room?
Understanding the location of acoustically reflective surfaces in a room is a critical component in advanced sound processing. For example, intelligent speakers can use a room's acoustic geometry to improve playback quality, source separation accuracy, and speech recognition. In this paper, we...
Saved in:
| Published in: | Proceedings of the 18th International Conference on Information Processing in Sensor Networks pp. 277 - 288 |
|---|---|
| Main Authors: | , |
| Format: | Conference Proceeding |
| Language: | English |
| Published: |
ACM
01.04.2019
|
| Subjects: | |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Abstract | Understanding the location of acoustically reflective surfaces in a room is a critical component in advanced sound processing. For example, intelligent speakers can use a room's acoustic geometry to improve playback quality, source separation accuracy, and speech recognition. In this paper, we present Synesthesia, a system for capturing the acoustic properties of a room using a single fixed speaker and a mobile phone that records audio at multiple locations. Using the arrival time of echoes, the system is able to reconstruct the position of reflective surfaces like walls and then estimate properties like surface absorption. Previous work has shown how the acoustic room impulse response (RIR) of an environment can be used to analyze echoes within a space to reconstruct room geometry. The best current RIR-based approaches rely on high-end equipment and capturing an acoustic signal broadcast into space from a known fixed constellation of microphones. They also require the precise calibration and measurement of microphone positions. In addition, most approaches pose constraints on room geometries and limit the order of RIR to achieve accurate and consistent results. In this paper, we introduce a new approach that performs RIR imaging using a mobile phone that tracks its location with visual inertial odometry (VIO) to record a dense set of samples albeit with noise in their locations. We present a new approach that is able to relax several key assumptions on RIR and show through both experimentation and simulation that even with 20cm of uncertainty in the microphone locations provided by VIO, we are still able to reconstruct the room geometry with accurate shape and dimensions. We demonstrate this capability by prototyping a tool for acoustic engineers, that allows a user to view a room's estimated geometry and absorption overlaid on the actual sensed space with augmented reality. |
|---|---|
| AbstractList | Understanding the location of acoustically reflective surfaces in a room is a critical component in advanced sound processing. For example, intelligent speakers can use a room's acoustic geometry to improve playback quality, source separation accuracy, and speech recognition. In this paper, we present Synesthesia, a system for capturing the acoustic properties of a room using a single fixed speaker and a mobile phone that records audio at multiple locations. Using the arrival time of echoes, the system is able to reconstruct the position of reflective surfaces like walls and then estimate properties like surface absorption. Previous work has shown how the acoustic room impulse response (RIR) of an environment can be used to analyze echoes within a space to reconstruct room geometry. The best current RIR-based approaches rely on high-end equipment and capturing an acoustic signal broadcast into space from a known fixed constellation of microphones. They also require the precise calibration and measurement of microphone positions. In addition, most approaches pose constraints on room geometries and limit the order of RIR to achieve accurate and consistent results. In this paper, we introduce a new approach that performs RIR imaging using a mobile phone that tracks its location with visual inertial odometry (VIO) to record a dense set of samples albeit with noise in their locations. We present a new approach that is able to relax several key assumptions on RIR and show through both experimentation and simulation that even with 20cm of uncertainty in the microphone locations provided by VIO, we are still able to reconstruct the room geometry with accurate shape and dimensions. We demonstrate this capability by prototyping a tool for acoustic engineers, that allows a user to view a room's estimated geometry and absorption overlaid on the actual sensed space with augmented reality. |
| Author | Shih, Oliver Rowe, Anthony |
| Author_xml | – sequence: 1 givenname: Oliver surname: Shih fullname: Shih, Oliver organization: Carnegie Mellon University, Pittsburgh, PA – sequence: 2 givenname: Anthony surname: Rowe fullname: Rowe, Anthony organization: Carnegie Mellon University, Pittsburgh, PA |
| BookMark | eNotj89LxDAQRiMoqGvPHrzk4rHrJDNpJieR4rrCguKP8zJtE7ritku7F_97A3r6Dg8e77tUp8M4RKWuDSyNIXeHCNZBtUQ0QOBPVBE8ZwBYWaZwrop5_gIAy47JhAt1W8ugRb_22aPXUSZ97KN-7-UQ9ZgyeRvH_f2VOkvyPcfifxfqc_X4Ua_LzcvTc_2wKcUyH0ukJjgJHbOId8kalwipa8RxIm6rlEu6AI2YzoAXqTiSw5iSE9962-JC3fx5dzHG7WHa7WX62bLHfIrwF9o7Pa0 |
| ContentType | Conference Proceeding |
| DBID | 6IE 6IL CBEJK RIE RIL |
| DOI | 10.1145/3302506.3310407 |
| DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE/IET Electronic Library (IEL) IEEE Proceedings Order Plans (POP All) 1998-Present |
| DatabaseTitleList | |
| Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://ieeexplore.ieee.org/ sourceTypes: Publisher |
| DeliveryMethod | fulltext_linktorsrc |
| EISBN | 9781450362849 1450362842 |
| EndPage | 288 |
| ExternalDocumentID | 8732504 |
| Genre | orig-research |
| GroupedDBID | 6IE 6IL ACM ALMA_UNASSIGNED_HOLDINGS APO CBEJK GUFHI LHSKQ RIE RIL |
| ID | FETCH-LOGICAL-a288t-34b95a9d88aa75f215f434dba58f48c6f145d90ba1d107aa68e453eff5a7c72c3 |
| IEDL.DBID | RIE |
| ISICitedReferencesCount | 9 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000474338900024&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| IngestDate | Wed Aug 27 06:03:22 EDT 2025 |
| IsDoiOpenAccess | false |
| IsOpenAccess | true |
| IsPeerReviewed | false |
| IsScholarly | false |
| Language | English |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-a288t-34b95a9d88aa75f215f434dba58f48c6f145d90ba1d107aa68e453eff5a7c72c3 |
| OpenAccessLink | https://dl.acm.org/doi/pdf/10.1145/3302506.3310407 |
| PageCount | 12 |
| ParticipantIDs | ieee_primary_8732504 |
| PublicationCentury | 2000 |
| PublicationDate | 2019-April |
| PublicationDateYYYYMMDD | 2019-04-01 |
| PublicationDate_xml | – month: 04 year: 2019 text: 2019-April |
| PublicationDecade | 2010 |
| PublicationTitle | Proceedings of the 18th International Conference on Information Processing in Sensor Networks |
| PublicationTitleAbbrev | IPSN |
| PublicationYear | 2019 |
| Publisher | ACM |
| Publisher_xml | – name: ACM |
| SSID | ssj0002858419 |
| Score | 1.7771319 |
| Snippet | Understanding the location of acoustically reflective surfaces in a room is a critical component in advanced sound processing. For example, intelligent... |
| SourceID | ieee |
| SourceType | Publisher |
| StartPage | 277 |
| SubjectTerms | Absorption Acoustics Active acoustic sensing Geometry Image reconstruction Microphones room reconstruction and mapping Surface reconstruction |
| Title | Can a Phone Hear the Shape of a Room? |
| URI | https://ieeexplore.ieee.org/document/8732504 |
| WOSCitedRecordID | wos000474338900024&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV05SwNREB6SYGGlkog3r9DOTXb33ZVFMKQKwQPShdl3EJvdEBN_v7O7ISLY2L0LhnkH8828OQDuuYs56pglNjiVCBtlgpmsfRpSj1HpLBauKTahZzOzWNh5Bx4PsTAhhMb5LAzrZvOX7yu3q01lI6N5nXGrC12tVRurdbCn5IZEaWb32XsyIUekqdNqNeSEYET6u3xKIz0mJ_-jewqDnzA8Nj8ImDPohLIPD2MsGbL5qioDm9I9ZQTh2OsK14FVkWZeCAo_DeB98vw2nib7WgcJ5sZsEy4KK9F6YxC1jCSIo-DCFyhNFMapSBx5mxaYeVLYEJUJQvIQo0TtdO74OfRKInwBjHqKtIoUvVZC0ROjDQ85CplG6VJrLqFfs7hct-kslnvurv4evoZjwgi2dVa5gd52swu3cOS-th-fm7vmDL4Bn8OFdw |
| linkProvider | IEEE |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LS0JREB7MglpVaPTuLGrX1fs4z1ULSYxMpAzcydzzwDZXMe33N_cqRtCm3XnBMOfBfDNnHgC3mQ0pqpBExlsZcRNEhIkofRpih0GqJOS2KjahBgM9HpthDe63sTDe-8r5zLfKZvWX72Z2VZrK2lplZcatHdgVnKfxOlpra1FJNQnTxGzy9yRctElXp_WylRGG4fHvAiqV_Oge_o_yETR_AvHYcCtijqHmiwbcdbBgyIbTWeFZj24qIxDH3qY492wWaOaVwPBDE967j6NOL9pUO4gw1XoZZTw3Ao3TGlGJQKI48Iy7HIUOXFsZiCNn4hwTRyobotSei8yHIFBZldrsBOoFET4FRj1JekWMTkku6ZHRlvsUuYiDsLHRZ9AoWZzM1wktJhvuzv8evoH93uilP-k_DZ4v4IAQg1m7rlxCfblY-SvYs1_Lj8_FdXUe38noiL4 |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=Proceedings+of+the+18th+International+Conference+on+Information+Processing+in+Sensor+Networks&rft.atitle=Can+a+Phone+Hear+the+Shape+of+a+Room%3F&rft.au=Shih%2C+Oliver&rft.au=Rowe%2C+Anthony&rft.date=2019-04-01&rft.pub=ACM&rft.spage=277&rft.epage=288&rft_id=info:doi/10.1145%2F3302506.3310407&rft.externalDocID=8732504 |