Sparse Pulsed Auditory Representations For Speech and Audio Coding (2005)
Abstract / truncated to 115 words
Auditory modeling is a well-established methodology that provides insight into human perception and that facilitates the extraction of signal features most relevant to the human listener for coding applications. This thesis deals with the approach of `coding in the perceptual domain' and is based on an invertible auditory model that provides a pulsed auditory representation of the input speech or audio signal. It is natural for pulsed signal representations to encode only the non-zero samples by specifying their positions as side information. For the considered auditory representation, the number of pulses and, therefore, the amount of side information is too high for an efficient encoding at a relatively low bit rate. The focus of this ...
masking – model inversion – filterbank
Information
- Author
- Christian Feldbauer
- Institution
- Graz University of Technology
- Supervisor
- Publication Year
- 2005
- Upload Date
- Aug. 28, 2015
The current layout is optimized for mobile phones. Page previews, thumbnails, and full abstracts will remain hidden until the browser window grows in width.
The current layout is optimized for tablet devices. Page previews and some thumbnails will remain hidden until the browser window grows in width.