Sparse Pulsed Auditory Representations For Speech and Audio Coding

Abstract / truncated to 115 words (read the full abstract)

Auditory modeling is a well-established methodology that provides insight into human perception and that facilitates the extraction of signal features most relevant to the human listener for coding applications. This thesis deals with the approach of `coding in the perceptual domain' and is based on an invertible auditory model that provides a pulsed auditory representation of the input speech or audio signal. It is natural for pulsed signal representations to encode only the non-zero samples by specifying their positions as side information. For the considered auditory representation, the number of pulses and, therefore, the amount of side information is too high for an efficient encoding at a relatively low bit rate. The focus of this ... toggle 3 keywords
masking – model inversion – filterbank

Information

Author

Christian Feldbauer

Institution

Graz University of Technology

Supervisor

Gernot Kubin

Publication Year

2005

Upload Date

Aug. 28, 2015

The current layout is optimized for mobile phones. Page previews, thumbnails, and full abstracts will remain hidden until the browser window grows in width.

The current layout is optimized for tablet devices. Page previews and some thumbnails will remain hidden until the browser window grows in width.

Follow @eurasip

Sparse Pulsed Auditory Representations For Speech and Audio Coding (2005)

Abstract / truncated to 115 words (read the full abstract)

Information

First few pages / click to enlarge