Can a Phone Hear the Shape of a Room?

Understanding the location of acoustically reflective surfaces in a room is a critical component in advanced sound processing. For example, intelligent speakers can use a room's acoustic geometry to improve playback quality, source separation accuracy, and speech recognition. In this paper, we...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Proceedings of the 18th International Conference on Information Processing in Sensor Networks S. 277 - 288
Hauptverfasser: Shih, Oliver, Rowe, Anthony
Format: Tagungsbericht
Sprache:Englisch
Veröffentlicht: ACM 01.04.2019
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Abstract Understanding the location of acoustically reflective surfaces in a room is a critical component in advanced sound processing. For example, intelligent speakers can use a room's acoustic geometry to improve playback quality, source separation accuracy, and speech recognition. In this paper, we present Synesthesia, a system for capturing the acoustic properties of a room using a single fixed speaker and a mobile phone that records audio at multiple locations. Using the arrival time of echoes, the system is able to reconstruct the position of reflective surfaces like walls and then estimate properties like surface absorption. Previous work has shown how the acoustic room impulse response (RIR) of an environment can be used to analyze echoes within a space to reconstruct room geometry. The best current RIR-based approaches rely on high-end equipment and capturing an acoustic signal broadcast into space from a known fixed constellation of microphones. They also require the precise calibration and measurement of microphone positions. In addition, most approaches pose constraints on room geometries and limit the order of RIR to achieve accurate and consistent results. In this paper, we introduce a new approach that performs RIR imaging using a mobile phone that tracks its location with visual inertial odometry (VIO) to record a dense set of samples albeit with noise in their locations. We present a new approach that is able to relax several key assumptions on RIR and show through both experimentation and simulation that even with 20cm of uncertainty in the microphone locations provided by VIO, we are still able to reconstruct the room geometry with accurate shape and dimensions. We demonstrate this capability by prototyping a tool for acoustic engineers, that allows a user to view a room's estimated geometry and absorption overlaid on the actual sensed space with augmented reality.
AbstractList Understanding the location of acoustically reflective surfaces in a room is a critical component in advanced sound processing. For example, intelligent speakers can use a room's acoustic geometry to improve playback quality, source separation accuracy, and speech recognition. In this paper, we present Synesthesia, a system for capturing the acoustic properties of a room using a single fixed speaker and a mobile phone that records audio at multiple locations. Using the arrival time of echoes, the system is able to reconstruct the position of reflective surfaces like walls and then estimate properties like surface absorption. Previous work has shown how the acoustic room impulse response (RIR) of an environment can be used to analyze echoes within a space to reconstruct room geometry. The best current RIR-based approaches rely on high-end equipment and capturing an acoustic signal broadcast into space from a known fixed constellation of microphones. They also require the precise calibration and measurement of microphone positions. In addition, most approaches pose constraints on room geometries and limit the order of RIR to achieve accurate and consistent results. In this paper, we introduce a new approach that performs RIR imaging using a mobile phone that tracks its location with visual inertial odometry (VIO) to record a dense set of samples albeit with noise in their locations. We present a new approach that is able to relax several key assumptions on RIR and show through both experimentation and simulation that even with 20cm of uncertainty in the microphone locations provided by VIO, we are still able to reconstruct the room geometry with accurate shape and dimensions. We demonstrate this capability by prototyping a tool for acoustic engineers, that allows a user to view a room's estimated geometry and absorption overlaid on the actual sensed space with augmented reality.
Author Shih, Oliver
Rowe, Anthony
Author_xml – sequence: 1
  givenname: Oliver
  surname: Shih
  fullname: Shih, Oliver
  organization: Carnegie Mellon University, Pittsburgh, PA
– sequence: 2
  givenname: Anthony
  surname: Rowe
  fullname: Rowe, Anthony
  organization: Carnegie Mellon University, Pittsburgh, PA
BookMark eNotj89LxDAQRiMoqGvPHrzk4rHrJDNpJieR4rrCguKP8zJtE7ritku7F_97A3r6Dg8e77tUp8M4RKWuDSyNIXeHCNZBtUQ0QOBPVBE8ZwBYWaZwrop5_gIAy47JhAt1W8ugRb_22aPXUSZ97KN-7-UQ9ZgyeRvH_f2VOkvyPcfifxfqc_X4Ua_LzcvTc_2wKcUyH0ukJjgJHbOId8kalwipa8RxIm6rlEu6AI2YzoAXqTiSw5iSE9962-JC3fx5dzHG7WHa7WX62bLHfIrwF9o7Pa0
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1145/3302506.3310407
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 9781450362849
1450362842
EndPage 288
ExternalDocumentID 8732504
Genre orig-research
GroupedDBID 6IE
6IL
ACM
ALMA_UNASSIGNED_HOLDINGS
APO
CBEJK
GUFHI
LHSKQ
RIE
RIL
ID FETCH-LOGICAL-a288t-34b95a9d88aa75f215f434dba58f48c6f145d90ba1d107aa68e453eff5a7c72c3
IEDL.DBID RIE
ISICitedReferencesCount 9
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000474338900024&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
IngestDate Wed Aug 27 06:03:22 EDT 2025
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-a288t-34b95a9d88aa75f215f434dba58f48c6f145d90ba1d107aa68e453eff5a7c72c3
OpenAccessLink https://dl.acm.org/doi/pdf/10.1145/3302506.3310407
PageCount 12
ParticipantIDs ieee_primary_8732504
PublicationCentury 2000
PublicationDate 2019-April
PublicationDateYYYYMMDD 2019-04-01
PublicationDate_xml – month: 04
  year: 2019
  text: 2019-April
PublicationDecade 2010
PublicationTitle Proceedings of the 18th International Conference on Information Processing in Sensor Networks
PublicationTitleAbbrev IPSN
PublicationYear 2019
Publisher ACM
Publisher_xml – name: ACM
SSID ssj0002858419
Score 1.7771319
Snippet Understanding the location of acoustically reflective surfaces in a room is a critical component in advanced sound processing. For example, intelligent...
SourceID ieee
SourceType Publisher
StartPage 277
SubjectTerms Absorption
Acoustics
Active acoustic sensing
Geometry
Image reconstruction
Microphones
room reconstruction and mapping
Surface reconstruction
Title Can a Phone Hear the Shape of a Room?
URI https://ieeexplore.ieee.org/document/8732504
WOSCitedRecordID wos000474338900024&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV07SwNBEB5isLBSScQ3W2jnJXe378oiGFJICD4gXZh9EZtLiIm_37lLiAg2dsvuwjDMsPPN7DwA7kgNsLRRZtwVKhNlKDKX6DGUXhbOojAiNJJ-1uOxmU7tpAUP-1qYGGOTfBZ79bL5yw8Lv6lDZX2jed1x6wAOtFbbWq19PKU0ZEoLu-veUwjZJ0-dbqseJwQj8t_jUxrrMTz-H90T6P6U4bHJ3sCcQitWHbgfYMWQTeaLKrIR6SkjCMde57iMbJHo5IWg8GMX3odPb4NRtpt1kGFpzDrjwlmJNhiDqGUiQ5wEF8GhNEkYrxJxFGzusAjksCEqE4XkMSWJ2uvS8zNoV0T4HJjmyloZYu6cJudX2BoGOGd8joXzKlxAp2Zxtty2s5jtuLv8e_sKjggj2G2yyjW016tNvIFD_7X--FzdNjL4Bv9PhRI
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1LSwMxEB5qFfSk0opvc9Cb2-5ukt3k5KFYKtZStEJvZfKiXraltv5-Z7elInjxFpLAMMyQ-WYyD4BbUgNMtZcRN0kWidQlkQn0GEorE6NRKOEqSffzwUCNx3pYg_ttLYz3vko-861yWf3lu5ldlaGytsp52XFrB3alEGm8rtbaRlRSRcY00Zv-PYmQbfLV6X7W4oRhRPx7gEplP7qH_6N8BM2fQjw23JqYY6j5ogF3HSwYsuF0VnjWI01lBOLY2xTnns0CnbwSGH5ownv3cdTpRZtpBxGmSi0jLoyWqJ1SiLkMZIqD4MIZlCoIZbNAHDkdG0wcuWyImfJCch-CxNzmqeUnUC-I8CmwnGdaS-djY3Jyf4UugYAxysaYGJu5M2iULE7m64YWkw13539v38B-b_TSn_SfBs8XcECIQa9TVy6hvlys_BXs2a_lx-fiupLHNyLTiFk
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=Proceedings+of+the+18th+International+Conference+on+Information+Processing+in+Sensor+Networks&rft.atitle=Can+a+Phone+Hear+the+Shape+of+a+Room%3F&rft.au=Shih%2C+Oliver&rft.au=Rowe%2C+Anthony&rft.date=2019-04-01&rft.pub=ACM&rft.spage=277&rft.epage=288&rft_id=info:doi/10.1145%2F3302506.3310407&rft.externalDocID=8732504