Deep Learning for Distant Speech Recognition

Abstract / truncated to 115 words (read the full abstract)

Deep learning is an emerging technology that is considered one of the most promising directions for reaching higher levels of artificial intelligence. Among the other achievements, building computers that understand speech represents a crucial leap towards intelligent machines. Despite the great efforts of the past decades, however, a natural and robust human-machine speech interaction still appears to be out of reach, especially when users interact with a distant microphone in noisy and reverberant environments. The latter disturbances severely hamper the intelligibility of a speech signal, making Distant Speech Recognition (DSR) one of the major open challenges in the field. This thesis addresses the latter scenario and proposes some novel techniques, architectures, and algorithms to improve ... toggle 5 keywords
deep learning – deep neural networks – distant speech recognition – far-field speech reocognition – ASR

Information

Author

Ravanelli, Mirco

Institution

Fondazione Bruno Kessler

Supervisor

Maurizio Omologo

Publication Year

2017

Upload Date

Dec. 17, 2017

The current layout is optimized for mobile phones. Page previews, thumbnails, and full abstracts will remain hidden until the browser window grows in width.

The current layout is optimized for tablet devices. Page previews and some thumbnails will remain hidden until the browser window grows in width.

Deep Learning for Distant Speech Recognition (2017)

Abstract / truncated to 115 words (read the full abstract)

Information

First few pages / click to enlarge