DSpace at KIST: Granular and explainable human activity recognition through sound segmentation and deep learning

Browse

Full metadata record

DC Field	Value	Language
dc.contributor.author	Kim, Jisoo	-
dc.contributor.author	Yoo, Byounghyun	-
dc.date.accessioned	2025-09-04T02:30:25Z	-
dc.date.available	2025-09-04T02:30:25Z	-
dc.date.created	2025-09-04	-
dc.date.issued	2025-08	-
dc.identifier.issn	2288-4300	-
dc.identifier.uri	https://pubs.kist.re.kr/handle/201004/153118	-
dc.description.abstract	Human Activity Recognition (HAR) plays a crucial role in identifying and digitizing human behaviors. Among various approaches, sound-based HAR offers distinct advantages, such as overcoming visual limitations and enabling recognition in diverse environments. This study introduces an innovative application of sound segmentation with SegNet, originally designed for image segmentation, to sound-based HAR. Traditionally, labeling sound data has been challenging due to its limited scope, often restricted to specific events or time frames. To address this issue, a novel labeling approach was developed, allowing detailed annotations across the entire temporal and frequency domains. This method facilitates the use of SegNet, which requires pixel-level labeling for accurate segmentation, leading to more granular and explainable activity recognition. A dataset comprising six distinct human activities-speech, groaning, screaming, coughing, toilet and snoring-was constructed to enable comprehensive evaluation. The trained neural network, utilizing this annotated dataset, achieved F1 scores ranging from 0.68 to 0.95. The model's practical applicability was further validated through recognition tests conducted in a professional office environment. This study presents a novel framework for quantifying daily human activities through sound segmentation, contributing to advancements in intelligent system technology.	-
dc.language	English	-
dc.publisher	한국CDE학회	-
dc.title	Granular and explainable human activity recognition through sound segmentation and deep learning	-
dc.type	Article	-
dc.identifier.doi	10.1093/jcde/qwaf075	-
dc.description.journalClass	1	-
dc.identifier.bibliographicCitation	Journal of Computational Design and Engineering, v.12, no.8, pp.252 - 269	-
dc.citation.title	Journal of Computational Design and Engineering	-
dc.citation.volume	12	-
dc.citation.number	8	-
dc.citation.startPage	252	-
dc.citation.endPage	269	-
dc.description.isOpenAccess	Y	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.identifier.wosid	001555098700001	-
dc.identifier.scopusid	2-s2.0-105014025877	-
dc.relation.journalWebOfScienceCategory	Computer Science, Interdisciplinary Applications	-
dc.relation.journalWebOfScienceCategory	Engineering, Multidisciplinary	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalResearchArea	Engineering	-
dc.type.docType	Article	-
dc.subject.keywordPlus	EVENT DETECTION	-
dc.subject.keywordPlus	FALL DETECTION	-
dc.subject.keywordPlus	CLASSIFICATION	-
dc.subject.keywordPlus	LOCALIZATION	-
dc.subject.keywordPlus	AUDIO	-
dc.subject.keywordPlus	CNN	-
dc.subject.keywordAuthor	Sound Segmentation	-
dc.subject.keywordAuthor	Human Activity Recognition	-
dc.subject.keywordAuthor	Sound Data Labeling	-
dc.subject.keywordAuthor	Explainable Artificial Intelligence	-

Appears in Collections:: KIST Article > Others

Export: RIS (EndNote); XLS (Excel); XML

Show Simple Item Record

KIST Library Institutional Repository

Browse

BROWSE