Full metadata record

DC Field Value Language
dc.contributor.authorKim, Soomin-
dc.contributor.authorPark, Juyoun-
dc.date.accessioned2024-06-25T04:32:34Z-
dc.date.available2024-06-25T04:32:34Z-
dc.date.created2024-06-25-
dc.date.issued2024-06-
dc.identifier.issn2169-3536-
dc.identifier.urihttps://pubs.kist.re.kr/handle/201004/150116-
dc.description.abstractWe, humans, perceive the scene utilizing pre-learned language categories. Our vocabulary system inherently possesses a hierarchy, aiding humans in understanding scenes at multiple levels. For example, when a person passes by chairs and desks from a distance rather than interacting with them up close, the objects are perceived from a broader perspective and recognized as furniture at a higher category level. In this work, we propose a multi-level semantic segmentation data generation method based on a scene-specific word tree to mimic human multi-level scene recognition. Multi-level semantic segmentation data encompasses diverse levels of grouped segmented areas with different degrees of detail, from the finest level of conventional semantic segmentation to coarser levels. Our scene-specific word trees leverage linguistic hierarchies to group scene components by considering relationships between words present in the scene. Furthermore, in the proposed data generation method, each word tree is constructed within a single image, allowing us to group the objects into user-selected levels, taking into account the relative relationship between objects in that scene. We demonstrate the effectiveness of our data generation method by building a multi-level scene segmentation network and training the model with the generated dataset, which reflects the scene-specific word tree.-
dc.languageEnglish-
dc.publisherInstitute of Electrical and Electronics Engineers Inc.-
dc.titleMulti-Level Segmentation Data Generation based on a Scene-Specific Word Tree-
dc.typeArticle-
dc.identifier.doi10.1109/access.2024.3418515-
dc.description.journalClass1-
dc.identifier.bibliographicCitationIEEE Access, v.12, pp.88202 - 88215-
dc.citation.titleIEEE Access-
dc.citation.volume12-
dc.citation.startPage88202-
dc.citation.endPage88215-
dc.description.isOpenAccessY-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.identifier.wosid001258792500001-
dc.relation.journalWebOfScienceCategoryComputer Science, Information Systems-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.relation.journalWebOfScienceCategoryTelecommunications-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaTelecommunications-
dc.type.docTypeArticle-
dc.subject.keywordAuthorSemantics-
dc.subject.keywordAuthorVisualization-
dc.subject.keywordAuthorSemantic segmentation-
dc.subject.keywordAuthorTraining-
dc.subject.keywordAuthorImage recognition-
dc.subject.keywordAuthorData models-
dc.subject.keywordAuthorSegmentation-
dc.subject.keywordAuthorsemantic grouping-
dc.subject.keywordAuthorlanguage hierarchy-
dc.subject.keywordAuthordataset generation-
dc.subject.keywordAuthormulti-level analysis-
dc.subject.keywordAuthorImage segmentation-
Appears in Collections:
KIST Article > 2024
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE