Full metadata record

DC Field Value Language
dc.contributor.authorKim, Jihyun-
dc.contributor.authorOh, Changjae-
dc.contributor.authorDo, Hoseok-
dc.contributor.authorChoi, Sunghwan-
dc.contributor.authorSohn, Kwanghoon-
dc.date.accessioned2025-07-30T09:00:05Z-
dc.date.available2025-07-30T09:00:05Z-
dc.date.created2025-07-28-
dc.date.issued2026-01-
dc.identifier.issn0957-4174-
dc.identifier.urihttps://pubs.kist.re.kr/handle/201004/152899-
dc.description.abstract3D semantic image synthesis generates photo-realistic and view-consistent images from a single semantic mask, which typically requires skills that apply to many practical applications like image generation, editing, and data augmentation. Existing methods for semantic image synthesis primarily focus on image reconstruction for the same view of the input, leading to artifacts when generating images from different views. To alleviate this, we propose a novel framework employing a learning-based 3D GAN inversion, which enables the generation of 3D-aware RGB images and corresponding semantic masks from a 2D single-view semantic mask. We present a Semantic Component-guided Normalization ResNet block, allowing our encoder to capture semantic representations and reflect them to the output images. To ensure semantic consistency across different views, we introduce a semantic decoder that produces an auxiliary-view semantic mask. This mask serves as a pseudo-input for learning 3D properties. Furthermore, we incorporate a 3D geometric prior that encourages the model to produce high-fidelity images from various viewpoints. Experimental results demonstrate that our method outperforms state-of-the-art 3D-aware semantic image synthesis methods.-
dc.languageEnglish-
dc.publisherElsevier-
dc.title3D semantic image synthesis with geometric and semantic consistency-
dc.typeArticle-
dc.identifier.doi10.1016/j.eswa.2025.128782-
dc.description.journalClass3-
dc.identifier.bibliographicCitationExpert Systems with Applications, v.295-
dc.citation.titleExpert Systems with Applications-
dc.citation.volume295-
dc.description.isOpenAccessN-
dc.identifier.wosid001529763200003-
dc.identifier.scopusid2-s2.0-105009855665-
dc.relation.journalWebOfScienceCategoryComputer Science, Artificial Intelligence-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.relation.journalWebOfScienceCategoryOperations Research & Management Science-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaOperations Research & Management Science-
dc.type.docTypeArticle-
dc.subject.keywordAuthorDeep learning-
dc.subject.keywordAuthorGenerative adversarial model-
dc.subject.keywordAuthorSemantic image synthesis-
dc.subject.keywordAuthor3D image synthesis-
Appears in Collections:
KIST Article > Others
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE