Granular and explainable human activity recognition through sound segmentation and deep learning
- Authors
- Kim, Jisoo; Yoo, Byounghyun
- Issue Date
- 2025-08
- Publisher
- 한국CDE학회
- Citation
- Journal of Computational Design and Engineering, v.12, no.8, pp.252 - 269
- Abstract
- Human Activity Recognition (HAR) plays a crucial role in identifying and digitizing human behaviors. Among various approaches, sound-based HAR offers distinct advantages, such as overcoming visual limitations and enabling recognition in diverse environments. This study introduces an innovative application of sound segmentation with SegNet, originally designed for image segmentation, to sound-based HAR. Traditionally, labeling sound data has been challenging due to its limited scope, often restricted to specific events or time frames. To address this issue, a novel labeling approach was developed, allowing detailed annotations across the entire temporal and frequency domains. This method facilitates the use of SegNet, which requires pixel-level labeling for accurate segmentation, leading to more granular and explainable activity recognition. A dataset comprising six distinct human activities-speech, groaning, screaming, coughing, toilet and snoring-was constructed to enable comprehensive evaluation. The trained neural network, utilizing this annotated dataset, achieved F1 scores ranging from 0.68 to 0.95. The model's practical applicability was further validated through recognition tests conducted in a professional office environment. This study presents a novel framework for quantifying daily human activities through sound segmentation, contributing to advancements in intelligent system technology.
- Keywords
- EVENT DETECTION; FALL DETECTION; CLASSIFICATION; LOCALIZATION; AUDIO; CNN; Sound Segmentation; Human Activity Recognition; Sound Data Labeling; Explainable Artificial Intelligence
- ISSN
- 2288-4300
- URI
- https://pubs.kist.re.kr/handle/201004/153118
- DOI
- 10.1093/jcde/qwaf075
- Appears in Collections:
- KIST Article > Others
- Files in This Item:
There are no files associated with this item.
- Export
- RIS (EndNote)
- XLS (Excel)
- XML
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.