Enhancing Osteoporosis Detection: An Explainable Multi-Modal Learning Framework with Feature Fusion and Variable Clustering

📅 2024-11-01
🏛️ arXiv.org
📈 Citations: 3
Influential: 0
📄 PDF
🤖 AI Summary
Early osteoporosis diagnosis is critical for preventing geriatric fractures, yet hindered by scarce labeled data and challenges in fusing heterogeneous multimodal data. This paper proposes a clinically grounded, interpretable dual-path multimodal learning framework: one pathway extracts features from X-ray images using VGG19, InceptionV3, or ResNet50; the other encodes standardized clinical variables. Both pathways undergo PCA-based dimensionality reduction, followed by a K-means–guided representative feature selection mechanism and end-to-end classification via fully connected networks. SHAP analysis identifies BMI, prior medical history, and height as the most discriminative clinical factors. Experiments demonstrate that clinical features dominate predictive performance—contributing significantly more than imaging features—while simultaneously enhancing both accuracy and interpretability. The framework delivers a trustworthy, deployable AI solution for primary-care osteoporosis screening.

Technology Category

Application Category

📝 Abstract
Osteoporosis is a common condition that increases fracture risk, especially in older adults. Early diagnosis is vital for preventing fractures, reducing treatment costs, and preserving mobility. However, healthcare providers face challenges like limited labeled data and difficulties in processing medical images. This study presents a novel multi-modal learning framework that integrates clinical and imaging data to improve diagnostic accuracy and model interpretability. The model utilizes three pre-trained networks-VGG19, InceptionV3, and ResNet50-to extract deep features from X-ray images. These features are transformed using PCA to reduce dimensionality and focus on the most relevant components. A clustering-based selection process identifies the most representative components, which are then combined with preprocessed clinical data and processed through a fully connected network (FCN) for final classification. A feature importance plot highlights key variables, showing that Medical History, BMI, and Height were the main contributors, emphasizing the significance of patient-specific data. While imaging features were valuable, they had lower importance, indicating that clinical data are crucial for accurate predictions. This framework promotes precise and interpretable predictions, enhancing transparency and building trust in AI-driven diagnoses for clinical integration.
Problem

Research questions and friction points this paper is trying to address.

Improving osteoporosis detection accuracy using multi-modal data fusion
Addressing limited labeled data and medical image processing challenges
Enhancing model interpretability for clinical trust and integration
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-modal learning integrates clinical and imaging data
PCA reduces dimensionality of deep features from X-ray images
Clustering-based selection identifies representative components for classification
🔎 Similar Papers
No similar papers found.
M
Mehdi Hosseini Chagahi
School of Electrical and Computer Engineering, College of Engineering, University of Tehran, Tehran, Iran
Saeed Mohammadi Dashtaki
Saeed Mohammadi Dashtaki
School of Electrical and Computer Engineering, College of Engineering, University of Tehran, Tehran, Iran
N
Niloufar Delfan
School of Electrical and Computer Engineering, College of Engineering, University of Tehran, Tehran, Iran
N
Nadia Mohammadi
Department of Epidemiology, Shiraz University of Medical Science, Shiraz, Iran
A
Alireza Samari
School of Mechanical Engineering, College of Engineering, University of Tehran, Tehran, Iran
Behzad Moshiri
Behzad Moshiri
Professor of School of ECE, Univ. of Tehran, Iran & Adjunct Professor of Univ. of Waterloo, Canada.
Advanced Process ControlSensor/Data FusionMechatronicsIndustrial AutomationIntelligent
M
Md. Jalil Piran
Department of Computer Science and Engineering, Sejong University, Seoul 05006, South Korea
Oliver Faust
Oliver Faust
School of Computing and Information Science, Anglia Ruskin University, East Road, Cambridge, UK