An environmental sound source classification system based on mel-frequency cepstral coefficients and Gaussian mixture models

Authors
Shen, G.Nguyen, Q.Choi, J.
Issue Date
2012-05
Publisher
IFAC Secretariat
Citation
14th IFAC Symposium on Information Control Problems in Manufacturing, INCOM'12, pp.1802 - 1807
Abstract
This paper proposed a study of a sound source classification system that has been developed for detecting and identifying the detected sound events in real environments. The proposed system was based on a pattern recognition approach using Gaussian mixture models and Mel-Frequency Cepstral Coefficients (MFCCs) features. We considered eight types of basic sound sources and an external sound. To make the system robust to various types of sound sources, we designed a tree of reference sound models for classification, in which especially generated total three of GMMs for external sounds according to different characteristics of frequency distributions. The performance of the proposed system, evaluated in terms of percent classification, indicated an averaged accuracy of 91.36% for off-line test. Finally, in on-line test our proposed system also showed a good and stable performance in real environments. ? 2012 IFAC.
ISSN
1474-6670
URI
https://pubs.kist.re.kr/handle/201004/80409
DOI
10.3182/20120523-3-RO-2023.00251
Appears in Collections:
KIST Conference Paper > 2012
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE