Abstract / truncated to 115 words (read the full abstract)

This thesis faces the problem of automatically classifying environmental sounds, i.e., any non-speech or non-music sounds that can be found in the environment. Broadly speaking, two main processes are needed to perform such classification: the signal feature extraction so as to compose representative sound patterns and the machine learning technique that performs the classification of such patterns. The main focus of this research is put on the former, studying relevant signal features that optimally represent the sound characteristics since, according to several references, it is a key issue to attain a robust recognition. This type of audio signals holds many differences with speech or music signals, thus specific features should be determined and adapted to ... toggle 7 keywords

audio classification audio scene recognition autocorrelation function environmental sound feature extraction gammatone filters MFCC

Information

Author
Valero, Xavier
Institution
La Salle-Universitat Ramon Llull
Supervisor
Publication Year
2013
Upload Date
April 3, 2013

First few pages / click to enlarge

The current layout is optimized for mobile phones. Page previews, thumbnails, and full abstracts will remain hidden until the browser window grows in width.

The current layout is optimized for tablet devices. Page previews and some thumbnails will remain hidden until the browser window grows in width.