Full metadata record

DC Field Value Language
dc.contributor.authorHeo, Seokhyeon-
dc.contributor.authorUhm, Kyeong Eun-
dc.contributor.authorYuk, Doyoung-
dc.contributor.authorKwon, Bo Mi-
dc.contributor.authorYoo, Byounghyun-
dc.contributor.authorKim, Jisoo-
dc.contributor.authorLee, Jongmin-
dc.date.accessioned2024-09-14T06:30:35Z-
dc.date.available2024-09-14T06:30:35Z-
dc.date.created2024-09-13-
dc.date.issued2024-08-
dc.identifier.urihttps://pubs.kist.re.kr/handle/201004/150587-
dc.description.abstractDysphagia, a disorder affecting the ability to swallow, has a high prevalence among the older adults and can lead to serious health complications. Therefore, early detection of dysphagia is important. This study evaluated the effectiveness of a newly developed deep learning model that analyzes syllable-segmented data for diagnosing dysphagia, an aspect not addressed in prior studies. The audio data of daily conversations were collected from 16 patients with dysphagia and 24 controls. The presence of dysphagia was determined by videofluoroscopic swallowing study. The data were segmented into syllables using a speech-to-text model and analyzed with a convolutional neural network to perform binary classification between the dysphagia patients and control group. The proposed model in this study was assessed in two different aspects. Firstly, with syllable-segmented analysis, it demonstrated a diagnostic accuracy of 0.794 for dysphagia, a sensitivity of 0.901, a specificity of 0.687, a positive predictive value of 0.742, and a negative predictive value of 0.874. Secondly, at the individual level, it achieved an overall accuracy of 0.900 and area under the curve of 0.953. This research highlights the potential of deep learning modal as an early, non-invasive, and simple method for detecting dysphagia in everyday environments.-
dc.languageEnglish-
dc.publisherNature Publishing Group-
dc.titleDeep learning approach for dysphagia detection by syllable-based speech analysis with daily conversations-
dc.typeArticle-
dc.identifier.doi10.1038/s41598-024-70774-z-
dc.description.journalClass1-
dc.identifier.bibliographicCitationScientific Reports, v.14, no.1-
dc.citation.titleScientific Reports-
dc.citation.volume14-
dc.citation.number1-
dc.description.isOpenAccessY-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.identifier.wosid001304007200022-
dc.identifier.scopusid2-s2.0-85202872228-
dc.relation.journalWebOfScienceCategoryMultidisciplinary Sciences-
dc.relation.journalResearchAreaScience & Technology - Other Topics-
dc.type.docTypeArticle-
dc.subject.keywordPlusASPIRATION-
dc.subject.keywordPlusVOICE-
dc.subject.keywordAuthorDysphagia-
dc.subject.keywordAuthorDeep learning-
dc.subject.keywordAuthorConversations-
dc.subject.keywordAuthorSyllable-based speech analysis-
dc.subject.keywordAuthorSpeech-to-text model-
dc.subject.keywordAuthorArtificial intelligence-
Appears in Collections:
KIST Article > 2024
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE