Tetrahedron-Net for Medical Image Registration

πŸ“… 2025-05-07
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
To address insufficient feature representation in medical image registration, this paper proposes a β€œtetrahedral” topology network architecture featuring a single encoder and dual decoders. The design enhances deformation modeling capability through dual-path decoder interaction and cross-scale feature fusion, departing from conventional U-shaped single-decoder paradigms. It supports three scalable instantiation variants, enabling plug-and-play performance improvements for mainstream models such as VoxelMorph and ViT-V-Net. Evaluated on LPBA40, OASIS, and MindBoggle101 benchmarks, the method achieves state-of-the-art results: average Dice score improves by 1.2–2.8%, Jacobian determinant abnormality rate decreases by 37%, and favorable trade-offs between generalizability and computational efficiency are maintained.

Technology Category

Application Category

πŸ“ Abstract
Medical image registration plays a vital role in medical image processing. Extracting expressive representations for medical images is crucial for improving the registration quality. One common practice for this end is constructing a convolutional backbone to enable interactions with skip connections among feature extraction layers. The de facto structure, U-Net-like networks, has attempted to design skip connections such as nested or full-scale ones to connect one single encoder and one single decoder to improve its representation capacity. Despite being effective, it still does not fully explore interactions with a single encoder and decoder architectures. In this paper, we embrace this observation and introduce a simple yet effective alternative strategy to enhance the representations for registrations by appending one additional decoder. The new decoder is designed to interact with both the original encoder and decoder. In this way, it not only reuses feature presentation from corresponding layers in the encoder but also interacts with the original decoder to corporately give more accurate registration results. The new architecture is concise yet generalized, with only one encoder and two decoders forming a ``Tetrahedron'' structure, thereby dubbed Tetrahedron-Net. Three instantiations of Tetrahedron-Net are further constructed regarding the different structures of the appended decoder. Our extensive experiments prove that superior performance can be obtained on several representative benchmarks of medical image registration. Finally, such a ``Tetrahedron'' design can also be easily integrated into popular U-Net-like architectures including VoxelMorph, ViT-V-Net, and TransMorph, leading to consistent performance gains.
Problem

Research questions and friction points this paper is trying to address.

Enhancing medical image registration representation quality
Exploring interactions beyond single encoder-decoder architectures
Integrating Tetrahedron design into U-Net-like networks
Innovation

Methods, ideas, or system contributions that make the work stand out.

Adds second decoder to enhance feature interactions
Forms Tetrahedron structure with one encoder
Integrates easily into U-Net-like architectures
πŸ”Ž Similar Papers
2024-08-01Workshop on Biomedical Image RegistrationCitations: 2
Jinhai Xiang
Jinhai Xiang
College of Informatics, Huazhong Agricultural University
computer VisionMachine Learning
S
Shuai Guo
Huazhong Agricultural University, Wuhan, 430070, China
Q
Qianru Han
Huazhong Agricultural University, Wuhan, 430070, China
D
Dantong Shi
Huazhong Agricultural University, Wuhan, 430070, China
X
Xinwei He
Huazhong Agricultural University, Wuhan, 430070, China
Xiang Bai
Xiang Bai
Huazhong University of Science and Technology (HUST)
Computer VisionOCR