Geometry-aware sound source localization using neural networks (2025)
Abstract / truncated to 115 words
Sound Source Localization (SSL) is the topic within acoustic signal processing which studies methods for the estimation of the position of one or more active sound sources in space, such as human talkers, using signals captured by one or more microphone arrays. It has many applications, including robot orientation, speech enhancement and diarization. Although signal processing-based algorithms have been the standard choice for SSL over past decades, deep neural networks have recently achieved state-of-the-art performance for this task. A drawback of most deep learning-based SSL methods consists of requiring the training and testing microphone and room geometry to be matched, restricting practical applications of available models. This is particularly relevant when using Distributed Microphone Arrays ...
sound source localization (ssl) – deep neural networks (dnns) – machine learning (ml) – deep learning (dl) – direction-of-arrival (doa) estimation – microphone arrays
Information
- Author
- Grinstein, Eric
- Institution
- Imperial College London
- Supervisors
- Publication Year
- 2025
- Upload Date
- March 14, 2025
The current layout is optimized for mobile phones. Page previews, thumbnails, and full abstracts will remain hidden until the browser window grows in width.
The current layout is optimized for tablet devices. Page previews and some thumbnails will remain hidden until the browser window grows in width.