DSpace at KIST: Cross-Modality Image Registration Via Generating Aligned Image Using Reference-Augmented Framework

Browse

DSpace at KISTKIST Article 2025

Cross-Modality Image Registration Via Generating Aligned Image Using Reference-Augmented Framework

Authors: Kim, Daniel; Shazly, Abdullah; Al-masni, Mohammed A.; Kim, Dong-Hyun; Ryu, Kanghyun

Issue Date: 2025-12

Publisher: Institute of Electrical and Electronics Engineers Inc.

Citation: IEEE Journal of Biomedical and Health Informatics, pp.1 - 14

Abstract: Aligning a pair of cross-modality images (e.g., MR-CT, CBCT-CT) is important, yet conventional approaches, including registration or Image-to-Image (I2I) translation methods often have limitations. To overcome these challenges, we introduce a “Register by Generation (RbG)” framework, a novel 2D deep learning approach designed to generate images that are structurally well-aligned with the fixed image while preserving the detailed intensity and contrast of the moving image, which we refer to as the reference image. Our approach operates in two sequential key stages: first, we employ a novel semi-global reference-augmented image synthesis network incorporating Patch Adaptive Instance Normalization (PAdaIN). This method leverages a down-sampled reference image to guide local adaptive synthesis, generating a more accurately aligned image with a reduced risk of hallucinations. In the second stage, we introduce a detailed refining reference-augmented network featuring a Deformation-Aware Cross-Attention (DACA) block, which aims to recover finer details and textures that may be missing from the initial stage. This unique component (DACA block) enables the transfer of corresponding relevant features from the reference image, effectively performing a “copy-and-paste” operation within the latent feature space. Additionally, we propose a novel combination of loss functions that enables self-supervised training on misaligned datasets, eliminating the need for pre-aligned data. We rigorously evaluate our method on multiple misaligned datasets using metrics focused on structural alignment and distributional consistency, demonstrating comprehensively superior performance. Furthermore, we test its robustness by simulating intentional misalignments in a well-aligned dataset. Additionally, experiments from a case study and downstream segmentation tasks highlight the broad applicability of our approach.

ISSN: 2168-2194

URI: https://pubs.kist.re.kr/handle/201004/153970

DOI: 10.1109/jbhi.2025.3642431

Appears in Collections:: KIST Article > 2025

Export: RIS (EndNote); XLS (Excel); XML

Show Full Item Record

KIST Library Institutional Repository

Browse

BROWSE