Discrete-time speech processing with application to emotion recognition

Abstract / truncated to 115 words (read the full abstract)

The subject of this PhD thesis is the efficient and robust processing and analysis of the audio recordings that are derived from a call center. The thesis is comprised of two parts. The first part is dedicated to dialogue/non-dialogue detection and to speaker segmentation. The systems that are developed are prerequisite for detecting (i) the audio segments that actually contain a dialogue between the system and the call center customer and (ii) the change points between the system and the customer. This way the volume of the audio recordings that need to be processed is significantly reduced, while the system is automated. To detect the presence of a dialogue several systems are developed. This is ... toggle 5 keywords
speech processing – emotion recognition – speaker segmentation – dialogue detection – machine learning

Information

Author

Kotti, Margarita

Institution

Aristotle University of Thessaloniki

Supervisor

Constantine Kotropoulos

Publication Year

2009

Upload Date

April 3, 2011

The current layout is optimized for mobile phones. Page previews, thumbnails, and full abstracts will remain hidden until the browser window grows in width.

The current layout is optimized for tablet devices. Page previews and some thumbnails will remain hidden until the browser window grows in width.

Follow @eurasip

Discrete-time speech processing with application to emotion recognition (2009)

Abstract / truncated to 115 words (read the full abstract)

Information

First few pages / click to enlarge