Sound Source Distance Estimation in Diverse and Dynamic Acoustic Conditions

Saksham Singh Kushwaha, Iran R. Roman, Magdalena Fuentes, Juan Pablo Bello

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Localizing a moving sound source in the real world involves determining its direction-of-arrival (DOA) and distance relative to a microphone. Advancements in DOA estimation have been facilitated by data-driven methods optimized with large open-source datasets with microphone array recordings in diverse environments. In contrast, estimating a sound source's distance remains understudied. Existing approaches assume recordings by non-coincident microphones to use methods that are susceptible to differences in room reverberation. We present a CRNN able to estimate the distance of moving sound sources across multiple datasets featuring diverse rooms, outperforming a recently-published approach. We also characterize our model's performance as a function of sound source distance and different training losses. This analysis reveals optimal training using a loss that weighs model errors as an inverse function of the sound source true distance. Our study is the first to demonstrate that sound source distance estimation can be performed across diverse acoustic conditions using deep learning.

Original languageEnglish (US)
Title of host publicationProceedings of the 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350323726
DOIs
StatePublished - 2023
Event2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2023 - New Paltz, United States
Duration: Oct 22 2023Oct 25 2023

Publication series

NameIEEE Workshop on Applications of Signal Processing to Audio and Acoustics
Volume2023-October
ISSN (Print)1931-1168
ISSN (Electronic)1947-1629

Conference

Conference2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2023
Country/TerritoryUnited States
CityNew Paltz
Period10/22/2310/25/23

Keywords

  • distance estimation
  • mean percentage error
  • multichannel audio
  • sound source localization

ASJC Scopus subject areas

  • Electrical and Electronic Engineering
  • Computer Science Applications

Fingerprint

Dive into the research topics of 'Sound Source Distance Estimation in Diverse and Dynamic Acoustic Conditions'. Together they form a unique fingerprint.

Cite this