Temporal-Enhanced Interpretable Multi-Modal Prognosis and Risk Stratification Framework for Diabetic Retinopathy (TIMM-ProRS)

📅 2026-01-13
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the challenge of misdiagnosing diabetic retinopathy in resource-limited settings, where its symptoms significantly overlap with those of other ocular diseases. To this end, the authors propose a novel multimodal deep learning framework that uniquely integrates Vision Transformers, convolutional neural networks (CNNs), and graph neural networks (GNNs) to jointly analyze static retinal images and dynamic time-series biomarkers—such as HbA1c levels and retinal thickness—enabling temporally enhanced, interpretable risk stratification and prognosis prediction. Evaluated across multiple international datasets, the method achieves a classification accuracy of 97.8% and an F1 score of 0.96, substantially outperforming existing models including RSG-Net and DeepDR. This approach establishes a new paradigm for early, precise diagnosis and management of diabetic retinopathy.

Technology Category

Application Category

📝 Abstract
Diabetic retinopathy (DR), affecting millions globally with projections indicating a significant rise, poses a severe blindness risk and strains healthcare systems. Diagnostic complexity arises from visual symptom overlap with conditions like age-related macular degeneration and hypertensive retinopathy, exacerbated by high misdiagnosis rates in underserved regions. This study introduces TIMM-ProRS, a novel deep learning framework integrating Vision Transformer (ViT), Convolutional Neural Network (CNN), and Graph Neural Network (GNN) with multi-modal fusion. TIMM-ProRS uniquely leverages both retinal images and temporal biomarkers (HbA1c, retinal thickness) to capture multi-modal and temporal dynamics. Evaluated comprehensively across diverse datasets including APTOS 2019 (trained), Messidor-2, RFMiD, EyePACS, and Messidor-1 (validated), the model achieves 97.8\% accuracy and an F1-score of 0.96, demonstrating state-of-the-art performance and outperforming existing methods like RSG-Net and DeepDR. This approach enables early, precise, and interpretable diagnosis, supporting scalable telemedical management and enhancing global eye health sustainability.
Problem

Research questions and friction points this paper is trying to address.

Diabetic Retinopathy
Misdiagnosis
Multi-Modal Diagnosis
Temporal Biomarkers
Risk Stratification
Innovation

Methods, ideas, or system contributions that make the work stand out.

multi-modal fusion
temporal biomarkers
Vision Transformer
Graph Neural Network
interpretable prognosis
🔎 Similar Papers
Susmita Kar
Susmita Kar
Assistant Professor, Scottish Church College
Quantum ChemistryBiology
A
A S M Ahsanul Sarkar Akib
Department of Robotics, Robo Tech Valley, Dhaka, Bangladesh
A
Abdul Hasib
Department of Internet of Things and Robotics Engineering, University of Frontier Technology, Bangladesh
Samin Yaser
Samin Yaser
Graduate Research Assistant, University of Central Florida
Explainable AISequential Decision MakingMachine LearningGraph LearningBiomedical AI
A
Anas Bin Azim
Department of Robotics, Robo Tech Valley, Dhaka, Bangladesh