A Geometric Deep Learning Approach to Sound Source Localization and Tracking

Abstract / truncated to 115 words (read the full abstract)

The localization and tracking of sound sources using microphone arrays is a problem that, even if it has attracted attention from the signal processing research community for decades, remains open. In recent years, deep learning models have surpassed the state-of-the-art that had been established by classic signal processing techniques, but these models still struggle with handling rooms with strong reverberations or tracking multiple sources that dynamically appear and disappear, especially when we cannot apply any criteria to classify or order them. In this thesis, we follow the ideas of the Geometric Deep Learning framework to propose new models and techniques that mean an advance of the state-of-the-art in the aforementioned scenarios. As the input of ... toggle 6 keywords
deep learning – microphone arrays – audio signal processing – localization – tracking – sound source localization

Information

Author

Diaz-Guerra, David

Institution

University of Zaragoza

Supervisors

Publication Year

2023

Upload Date

Sept. 25, 2023

The current layout is optimized for mobile phones. Page previews, thumbnails, and full abstracts will remain hidden until the browser window grows in width.

The current layout is optimized for tablet devices. Page previews and some thumbnails will remain hidden until the browser window grows in width.

A Geometric Deep Learning Approach to Sound Source Localization and Tracking (2023)

Abstract / truncated to 115 words (read the full abstract)

Information

First few pages / click to enlarge