Data-driven Speech Enhancement: from Non-negative Matrix Factorization to Deep Representation Learning

Abstract / truncated to 115 words (read the full abstract)

In natural listening environments, speech signals are easily distorted by variousacoustic interference, which reduces the speech quality and intelligibility of human listening; meanwhile, it makes difficult for many speech-related applications, such as automatic speech recognition (ASR). Thus, many speech enhancement (SE) algorithms have been developed in the past decades. However, most current SE algorithms are difficult to capture underlying speech information (e.g., phoneme) in the SE process. This causes it to be challenging to know what specific information is lost or interfered with in the SE process, which limits the application of enhanced speech. For instance, some SE algorithms aimed to improve human listening usually damage the ASR system. The objective of this dissertation is ... toggle 3 keywords
speech enhancement – non-negative matrix factorization – deep representation learning

Information

Author

Xiang, Yang

Institution

Aalborg University, Capturi A/S

Supervisors

Publication Year

2023

Upload Date

Feb. 14, 2023

The current layout is optimized for mobile phones. Page previews, thumbnails, and full abstracts will remain hidden until the browser window grows in width.

The current layout is optimized for tablet devices. Page previews and some thumbnails will remain hidden until the browser window grows in width.

Data-driven Speech Enhancement: from Non-negative Matrix Factorization to Deep Representation Learning (2023)

Abstract / truncated to 115 words (read the full abstract)

Information

First few pages / click to enlarge