🤖 AI Summary
This study addresses the high cost and limited accessibility of MRI in osteoarthritis diagnosis by proposing a conditional diffusion model (DDPM)-based cross-modal synthesis method that generates knee MRI volumes from X-ray images. To enhance clinical relevance, the approach explicitly integrates X-ray data with patient-specific clinical metadata—including age, BMI, and joint space width—via a radiological feature encoder and a metadata embedding module. A progressive denoising scheme coupled with multi-scale reconstruction ensures temporal coherence and anatomical plausibility across the synthesized MRI sequence. Quantitative and qualitative evaluations demonstrate that the generated MRIs achieve superior visual quality and soft-tissue fidelity compared to baseline methods. Ablation studies confirm that incorporating patient metadata improves Dice similarity coefficient by 12.3% and increases inter-rater agreement among clinical experts by 27%, underscoring the method’s clinical validity and practical utility.
📝 Abstract
Knee osteoarthritis (KOA) is a prevalent musculoskeletal disorder, often diagnosed using X-rays due to its cost-effectiveness. While Magnetic Resonance Imaging (MRI) provides superior soft tissue visualization and serves as a valuable supplementary diagnostic tool, its high cost and limited accessibility significantly restrict its widespread use. To explore the feasibility of bridging this imaging gap, we conducted a feasibility study leveraging a diffusion-based model that uses an X-ray image as conditional input, alongside target depth and additional patient-specific feature information, to generate corresponding MRI sequences. Our findings demonstrate that the MRI volumes generated by our approach is visually closer to real MRI scans. Moreover, increasing inference steps enhances the continuity and smoothness of the synthesized MRI sequences. Through ablation studies, we further validate that integrating supplementary patient-specific information, beyond what X-rays alone can provide, enhances the accuracy and clinical relevance of the generated MRI, which underscores the potential of leveraging external patient-specific information to improve the MRI generation. This study is available at https://zwang78.github.io/.