3D-C2FT: Coarse-to-Fine Transformer for Multi-view 3D Reconstruction

Authors
Tiong, Leslie Ching OwSigmund, DickTeoh, Andrew Beng Jin
Issue Date
2022-12
Publisher
SPRINGER INTERNATIONAL PUBLISHING AG
Citation
16th Asian Conference on Computer Vision (ACCV), pp.211 - 227
Abstract
Recently, the transformer model has been successfully employed for the multi-view 3D reconstruction problem. However, challenges remain in designing an attention mechanism to explore the multi-view features and exploit their relations for reinforcing the encoding-decoding modules. This paper proposes a new model, namely 3D coarse-to-fine transformer (3D-C2FT), by introducing a novel coarse-to-fine (C2F) attention mechanism for encoding multi-view features and rectifying defective voxel-based 3D objects. C2F attention mechanism enables the model to learn multi-view information flow and synthesize 3D surface correction in a coarse to fine-grained manner. The proposed model is evaluated by ShapeNet and Multi-view Real-life voxel-based datasets. Experimental results show that 3D-C2FT achieves notable results and outperforms several competing models on these datasets.
ISSN
0302-9743
URI
https://pubs.kist.re.kr/handle/201004/76518
DOI
10.1007/978-3-031-26319-4_13
Appears in Collections:
KIST Conference Paper > 2022
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML

qrcode

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

BROWSE