Applying the Bi-level HMM for Robust Voice-activity Detection
- Authors
- Hwang, Yongwon; Jeong, Mun-Ho; Oh, Sang-Rok; Kim, Il-Hwan
- Issue Date
- 2017-01
- Publisher
- 대한전기학회
- Citation
- Journal of Electrical Engineering & Technology, v.12, no.1, pp.373 - 377
- Abstract
- This paper presents a voice-activity detection (VAD) method for sound sequences with various SNRs. For real-time VAD applications, it is inadequate to employ a post-processing for the removal of burst clippings from the VAD output decision. To tackle this problem, building on the bilevel hidden Markov model, for which a state layer is inserted into a typical hidden Markov model (HMM), we formulated a robust method for VAD not requiring any additional post-processing. In the method, a forward-inference-ratio test was devised to detect the speech endpoints and Mel-frequency cepstral coefficients (MFCC) were used as the features. Our experiment results show that, regarding different SNRs, the performance of the proposed approach is more outstanding than those of the conventional methods.
- Keywords
- Voice-activity detection; Bi-level HMM; Forward-inference-ratio test
- ISSN
- 1975-0102
- URI
- https://pubs.kist.re.kr/handle/201004/123235
- DOI
- 10.5370/JEET.2017.12.1.373
- Appears in Collections:
- KIST Article > 2017
- Files in This Item:
There are no files associated with this item.
- Export
- RIS (EndNote)
- XLS (Excel)
- XML
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.