Deep Learning for Distant Speech Recognition (2017)
Abstract / truncated to 115 words
Deep learning is an emerging technology that is considered one of the most promising directions for reaching higher levels of artificial intelligence. Among the other achievements, building computers that understand speech represents a crucial leap towards intelligent machines. Despite the great efforts of the past decades, however, a natural and robust human-machine speech interaction still appears to be out of reach, especially when users interact with a distant microphone in noisy and reverberant environments. The latter disturbances severely hamper the intelligibility of a speech signal, making Distant Speech Recognition (DSR) one of the major open challenges in the field. This thesis addresses the latter scenario and proposes some novel techniques, architectures, and algorithms to improve ...
deep learning – deep neural networks – distant speech recognition – far-field speech reocognition – ASR
Information
- Author
- Ravanelli, Mirco
- Institution
- Fondazione Bruno Kessler
- Supervisor
- Publication Year
- 2017
- Upload Date
- Dec. 17, 2017
The current layout is optimized for mobile phones. Page previews, thumbnails, and full abstracts will remain hidden until the browser window grows in width.
The current layout is optimized for tablet devices. Page previews and some thumbnails will remain hidden until the browser window grows in width.