Enhancing Fundus Image-based Glaucoma Screening via Dynamic Global-Local Feature Integration

📅 2025-04-01
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the challenges of high variability in fundus image quality, poor cross-device and cross-ethnic generalizability, and ambiguous cup-to-disc boundary delineation—leading to insufficient classification robustness in glaucoma diagnosis—this paper proposes a dynamic global-local feature fusion framework. Our method introduces two key innovations: (1) an adaptive attention window mechanism that automatically identifies optimal anatomical regions for optic cup and disc localization; and (2) a multi-head attention-driven linear readout strategy that explicitly models structural consistency (global) and local discriminability (local). Evaluated on a large-scale, real-world clinical dataset encompassing multiple centers, imaging devices, and ethnic groups, our approach achieves significant improvements in classification accuracy and cross-domain generalization, consistently outperforming state-of-the-art methods in robustness and reliability.

Technology Category

Application Category

📝 Abstract
With the advancements in medical artificial intelligence (AI), fundus image classifiers are increasingly being applied to assist in ophthalmic diagnosis. While existing classification models have achieved high accuracy on specific fundus datasets, they struggle to address real-world challenges such as variations in image quality across different imaging devices, discrepancies between training and testing images across different racial groups, and the uncertain boundaries due to the characteristics of glaucomatous cases. In this study, we aim to address the above challenges posed by image variations by highlighting the importance of incorporating comprehensive fundus image information, including the optic cup (OC) and optic disc (OD) regions, and other key image patches. Specifically, we propose a self-adaptive attention window that autonomously determines optimal boundaries for enhanced feature extraction. Additionally, we introduce a multi-head attention mechanism to effectively fuse global and local features via feature linear readout, improving the model's discriminative capability. Experimental results demonstrate that our method achieves superior accuracy and robustness in glaucoma classification.
Problem

Research questions and friction points this paper is trying to address.

Addressing image quality variations across different devices
Resolving discrepancies in training and testing images among races
Improving uncertain boundaries in glaucomatous case detection
Innovation

Methods, ideas, or system contributions that make the work stand out.

Self-adaptive attention window for optimal boundaries
Multi-head attention mechanism for feature fusion
Dynamic global-local feature integration for robustness
🔎 Similar Papers
No similar papers found.
Y
Yuzhuo Zhou
Minzu University Of China, China
C
Chi Liu
Faculty of Data Science, City University of Macau, Macao SAR, China
Sheng Shen
Sheng Shen
Facebook Reality Labs
Mobile ComputingWireless SensingInternet of Things (IoT)AR/VRLocalization
S
Siyu Le
Minzu University Of China, China
L
Liwen Yu
Minzu University Of China, China
S
Sihan Ouyang
Minzu University Of China, China
Z
Zongyuan Ge
Faculty of Information Technology, Monash University, Melbourne, VIC, Australia