Applying the Bi-level HMM for Robust Voice-activity Detection

Authors
Hwang, YongwonJeong, Mun-HoOh, Sang-RokKim, Il-Hwan
Issue Date
2017-01
Publisher
대한전기학회
Citation
Journal of Electrical Engineering & Technology, v.12, no.1, pp.373 - 377
Abstract
This paper presents a voice-activity detection (VAD) method for sound sequences with various SNRs. For real-time VAD applications, it is inadequate to employ a post-processing for the removal of burst clippings from the VAD output decision. To tackle this problem, building on the bilevel hidden Markov model, for which a state layer is inserted into a typical hidden Markov model (HMM), we formulated a robust method for VAD not requiring any additional post-processing. In the method, a forward-inference-ratio test was devised to detect the speech endpoints and Mel-frequency cepstral coefficients (MFCC) were used as the features. Our experiment results show that, regarding different SNRs, the performance of the proposed approach is more outstanding than those of the conventional methods.
Keywords
Voice-activity detection; Bi-level HMM; Forward-inference-ratio test
ISSN
1975-0102
URI
https://pubs.kist.re.kr/handle/201004/123235
DOI
10.5370/JEET.2017.12.1.373
Appears in Collections:
KIST Article > 2017
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE