Conditional Diffusion for 3D CT Volume Reconstruction from 2D X-rays

📅 2026-03-27
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of efficiently reconstructing diagnostically valuable 3D CT volumes from routine clinical 2D X-ray images to reduce radiation exposure, cost, and improve healthcare accessibility. The authors propose AXON, a novel framework that achieves high-quality 3D CT reconstruction from real-world clinical X-ray data for the first time. AXON integrates a Brownian bridge diffusion model for global structure generation with ControlNet-guided local detail refinement, supporting dual-plane X-ray inputs and 3D super-resolution upsampling within a coarse-to-fine collaborative strategy. Evaluated on multiple public and external datasets, AXON improves PSNR and SSIM by 11.9% and 11.0%, respectively, substantially outperforming existing methods while demonstrating strong cross-center generalizability and clinical applicability.
📝 Abstract
Computed tomography (CT) provides rich 3D anatomical details but is often constrained by high radiation exposure, substantial costs, and limited availability. While standard chest X-rays are cost-effective and widely accessible, they only provide 2D projections with limited pathological information. Reconstructing 3D CT volumes from 2D X-rays offers a transformative solution to increase diagnostic accessibility, yet existing methods predominantly rely on synthetic X-ray projections, limiting clinical generalization. In this work, we propose AXON, a multi-stage diffusion-based framework that reconstructs high-fidelity 3D CT volumes directly from real X-rays. AXON employs a coarse-to-fine strategy, with a Brownian Bridge diffusion model-based initial stage for global structural synthesis, followed by a ControlNet-based refinement stage for local intensity optimization. It also supports bi-planar X-ray input to mitigate depth ambiguities inherent in 2D-to-3D reconstruction. A super-resolution network is integrated to upscale the generated volumes to achieve diagnostic-grade resolution. Evaluations on both public and external datasets demonstrate that AXON significantly outperforms state-of-the-art baselines, achieving a 11.9% improvement in PSNR and a 11.0% increase in SSIM with robust generalizability across disparate clinical distributions. Our code is available at https://github.com/ai-med/AXON.
Problem

Research questions and friction points this paper is trying to address.

3D CT reconstruction
2D X-ray
clinical generalization
depth ambiguity
diagnostic accessibility
Innovation

Methods, ideas, or system contributions that make the work stand out.

conditional diffusion
3D CT reconstruction
real X-ray input
Brownian Bridge diffusion
ControlNet refinement
🔎 Similar Papers
No similar papers found.
M
Martin Rath
Technical University of Munich (TUM), Germany; Munich Center for Machine Learning (MCML), Germany
M
Morteza Ghahremani
Technical University of Munich (TUM), Germany; Munich Center for Machine Learning (MCML), Germany
Yitong Li
Yitong Li
PhD candidate in Technical University of Munich
Computer VisionMachine LearningMedical ImagingArtificial Intelligence
A
Ashkan Taghipour
University of Western Australia (UWA), Australia
M
Marcus Makowski
Technical University of Munich (TUM), Germany
Christian Wachinger
Christian Wachinger
Technical University of Munich
AI in Medical ImagingGeometric Deep LearningCausal InferenceMulti-Modal Diagnostics