DSpace at KIST: 3D semantic image synthesis with geometric and semantic consistency

Browse

Full metadata record

DC Field	Value	Language
dc.contributor.author	Kim, Jihyun	-
dc.contributor.author	Oh, Changjae	-
dc.contributor.author	Do, Hoseok	-
dc.contributor.author	Choi, Sunghwan	-
dc.contributor.author	Sohn, Kwanghoon	-
dc.date.accessioned	2025-07-30T09:00:05Z	-
dc.date.available	2025-07-30T09:00:05Z	-
dc.date.created	2025-07-28	-
dc.date.issued	2026-01	-
dc.identifier.issn	0957-4174	-
dc.identifier.uri	https://pubs.kist.re.kr/handle/201004/152899	-
dc.description.abstract	3D semantic image synthesis generates photo-realistic and view-consistent images from a single semantic mask, which typically requires skills that apply to many practical applications like image generation, editing, and data augmentation. Existing methods for semantic image synthesis primarily focus on image reconstruction for the same view of the input, leading to artifacts when generating images from different views. To alleviate this, we propose a novel framework employing a learning-based 3D GAN inversion, which enables the generation of 3D-aware RGB images and corresponding semantic masks from a 2D single-view semantic mask. We present a Semantic Component-guided Normalization ResNet block, allowing our encoder to capture semantic representations and reflect them to the output images. To ensure semantic consistency across different views, we introduce a semantic decoder that produces an auxiliary-view semantic mask. This mask serves as a pseudo-input for learning 3D properties. Furthermore, we incorporate a 3D geometric prior that encourages the model to produce high-fidelity images from various viewpoints. Experimental results demonstrate that our method outperforms state-of-the-art 3D-aware semantic image synthesis methods.	-
dc.language	English	-
dc.publisher	Elsevier	-
dc.title	3D semantic image synthesis with geometric and semantic consistency	-
dc.type	Article	-
dc.identifier.doi	10.1016/j.eswa.2025.128782	-
dc.description.journalClass	1	-
dc.identifier.bibliographicCitation	Expert Systems with Applications, v.295	-
dc.citation.title	Expert Systems with Applications	-
dc.citation.volume	295	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.identifier.wosid	001529763200003	-
dc.identifier.scopusid	2-s2.0-105009855665	-
dc.relation.journalWebOfScienceCategory	Computer Science, Artificial Intelligence	-
dc.relation.journalWebOfScienceCategory	Engineering, Electrical & Electronic	-
dc.relation.journalWebOfScienceCategory	Operations Research & Management Science	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalResearchArea	Engineering	-
dc.relation.journalResearchArea	Operations Research & Management Science	-
dc.type.docType	Article	-
dc.subject.keywordAuthor	Deep learning	-
dc.subject.keywordAuthor	Generative adversarial model	-
dc.subject.keywordAuthor	Semantic image synthesis	-
dc.subject.keywordAuthor	3D image synthesis	-

Appears in Collections:: KIST Article > Others

Export: RIS (EndNote); XLS (Excel); XML

Show Simple Item Record

KIST Library Institutional Repository

Browse

BROWSE