Dual encoding feature filtering generalized attention UNET for retinal vessel segmentation

📅 2025-06-02
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Retinal vessel segmentation suffers from limited training samples, severe class imbalance, and inadequate feature representation, leading to poor generalizability and high false-positive rates. To address these challenges, we propose a novel U-Net variant featuring three key innovations: (1) a dual-encoder architecture that separately models domain-invariant preprocessed features and raw-image semantics; (2) a feature filtering fusion module and an attention-guided feature reconstruction fusion module—replacing conventional skip connections to enhance multi-scale feature selectivity and contextual awareness; and (3) a customized data augmentation and resampling strategy to mitigate distribution shift and class imbalance. Evaluated on five benchmark datasets—DRIVE, CHASE_DB1, STARE, HRF, and IOSTAR—our method consistently outperforms baseline models and state-of-the-art approaches, demonstrating substantial improvements in cross-dataset generalization performance.

Technology Category

Application Category

📝 Abstract
Retinal blood vessel segmentation is crucial for diagnosing ocular and cardiovascular diseases. Although the introduction of U-Net in 2015 by Olaf Ronneberger significantly advanced this field, yet issues like limited training data, imbalance data distribution, and inadequate feature extraction persist, hindering both the segmentation performance and optimal model generalization. Addressing these critical issues, the DEFFA-Unet is proposed featuring an additional encoder to process domain-invariant pre-processed inputs, thereby improving both richer feature encoding and enhanced model generalization. A feature filtering fusion module is developed to ensure the precise feature filtering and robust hybrid feature fusion. In response to the task-specific need for higher precision where false positives are very costly, traditional skip connections are replaced with the attention-guided feature reconstructing fusion module. Additionally, innovative data augmentation and balancing methods are proposed to counter data scarcity and distribution imbalance, further boosting the robustness and generalization of the model. With a comprehensive suite of evaluation metrics, extensive validations on four benchmark datasets (DRIVE, CHASEDB1, STARE, and HRF) and an SLO dataset (IOSTAR), demonstrate the proposed method's superiority over both baseline and state-of-the-art models. Particularly the proposed method significantly outperforms the compared methods in cross-validation model generalization.
Problem

Research questions and friction points this paper is trying to address.

Improves retinal vessel segmentation accuracy and generalization
Addresses limited training data and imbalance distribution
Enhances feature extraction with dual encoding and filtering
Innovation

Methods, ideas, or system contributions that make the work stand out.

Dual encoder for richer feature encoding
Feature filtering fusion module for precision
Attention-guided feature reconstructing fusion
🔎 Similar Papers
No similar papers found.
Md Tauhidul Islam
Md Tauhidul Islam
Assistant Professor, Stanford University
Deep learningMedical Image AnalysisGenomicsAI InterpretabilityUltrasound
D
Da-Wen Wu
Department of Ophthalmology, West China Hospital, Sichuan University, Chengdu 610041, China
Q
Qing-Qing Tang
Department of Ophthalmology, West China Hospital, Sichuan University, Chengdu 610041, China
K
Kai-Yang Zhao
West China School of Medicine, Sichuan University, Chengdu 610041, China
Y
Yin Teng
Machine Intelligence Laboratory, College of Computer Science, Sichuan University, Chengdu 610064, China
Y
Yan-Fei Li
Machine Intelligence Laboratory, College of Computer Science, Sichuan University, Chengdu 610064, China
W
Wen-Yi Shang
Machine Intelligence Laboratory, College of Computer Science, Sichuan University, Chengdu 610064, China
J
Jing-Yu Liu
Machine Intelligence Laboratory, College of Computer Science, Sichuan University, Chengdu 610064, China
H
Hai-Xian Zhang
Machine Intelligence Laboratory, College of Computer Science, Sichuan University, Chengdu 610064, China