Perceptually Motivated Speech Enhancement

Abstract / truncated to 115 words (read the full abstract)

Speech Enhancement (SE) is a vital technology for online human communication. Applications of Deep Neural Network (DNN) technologies in concert with traditional signal processing approaches to the task have revolutionised both the research and implementation of SE in recent years. However, the training objective of these Neural Network Speech Enhancement (NNSE) systems generally do not consider the psychoacoustic processing which occurs in the human auditory system. As a result, enhanced audio can often contain auditory artefacts which degrade the perceptual quality or intelligibility of the speech. To overcome this, systems which directly incorporate psychoacoustically motivated measures into the training objectives of NNSE systems have been proposed. A key development in speech audio processing in recent ... toggle 5 keywords
speech enhancement – neural networks – artificial intelligence – speech quality – speech intelligibility

Information

Author

Close, George

Institution

University of Sheffield

Supervisors

Publication Year

2025

Upload Date

April 3, 2025

The current layout is optimized for mobile phones. Page previews, thumbnails, and full abstracts will remain hidden until the browser window grows in width.

The current layout is optimized for tablet devices. Page previews and some thumbnails will remain hidden until the browser window grows in width.

Perceptually Motivated Speech Enhancement (2025)

Abstract / truncated to 115 words (read the full abstract)

Information

First few pages / click to enlarge