Full end-to-end diagnostic workflow automation of 3D OCT via foundation model-driven AI for retinal diseases

πŸ“… 2026-02-03
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This work addresses the limitations of existing fully automated diagnostic approaches for 3D optical coherence tomography (OCT) imaging, which are hindered by multi-stage pipelines and single-slice, single-task models that impede efficient end-to-end analysis. The authors propose FOCUS, the first foundation-model-based framework enabling fully automated, end-to-end 3D OCT diagnosis. FOCUS integrates image quality assessment, anomaly detection, and multi-disease classification within a unified architecture, and introduces a novel adaptive aggregation strategy to effectively fuse 2D slice-level predictions into patient-level diagnoses. On an internal test set, FOCUS achieves F1 scores of 99.01%, 97.46%, and 94.39% for quality assessment, anomaly detection, and patient-level diagnosis, respectively. External multicenter validation demonstrates robust performance with F1 scores ranging from 90.22% to 95.24%, matching expert-level accuracy while significantly improving diagnostic efficiency.

Technology Category

Application Category

πŸ“ Abstract
Optical coherence tomography (OCT) has revolutionized retinal disease diagnosis with its high-resolution and three-dimensional imaging nature, yet its full diagnostic automation in clinical practices remains constrained by multi-stage workflows and conventional single-slice single-task AI models. We present Full-process OCT-based Clinical Utility System (FOCUS), a foundation model-driven framework enabling end-to-end automation of 3D OCT retinal disease diagnosis. FOCUS sequentially performs image quality assessment with EfficientNetV2-S, followed by abnormality detection and multi-disease classification using a fine-tuned Vision Foundation Model. Crucially, FOCUS leverages a unified adaptive aggregation method to intelligently integrate 2D slices-level predictions into comprehensive 3D patient-level diagnosis. Trained and tested on 3,300 patients (40,672 slices), and externally validated on 1,345 patients (18,498 slices) across four different-tier centers and diverse OCT devices, FOCUS achieved high F1 scores for quality assessment (99.01%), abnormally detection (97.46%), and patient-level diagnosis (94.39%). Real-world validation across centers also showed stable performance (F1: 90.22%-95.24%). In human-machine comparisons, FOCUS matched expert performance in abnormality detection (F1: 95.47% vs 90.91%) and multi-disease diagnosis (F1: 93.49% vs 91.35%), while demonstrating better efficiency. FOCUS automates the image-to-diagnosis pipeline, representing a critical advance towards unmanned ophthalmology with a validated blueprint for autonomous screening to enhance population scale retinal care accessibility and efficiency.
Problem

Research questions and friction points this paper is trying to address.

3D OCT
retinal diseases
diagnostic automation
end-to-end workflow
clinical AI
Innovation

Methods, ideas, or system contributions that make the work stand out.

foundation model
end-to-end automation
3D OCT
adaptive aggregation
retinal disease diagnosis
πŸ”Ž Similar Papers
No similar papers found.
J
Jinze Zhang
State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangzhou, China
J
Jian Zhong
Department of Electrical and Electronic Engineering, Southern University of Science and Technology, Shenzhen, China
L
Li Lin
Department of Electrical and Electronic Engineering, Southern University of Science and Technology, Shenzhen, China
J
Jiaxiong Li
State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangzhou, China
K
Ke Ma
Department of Health Technology and Informatics, The Hong Kong Polytechnic University, Hong Kong, China
N
Naiyang Li
Eye Center, Zhongshan City People's Hospital, Zhongshan, Guangdong, China
Meng Li
Meng Li
China University of Mining and Technology
Mining Engineering
Y
Yuan Pan
State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangzhou, China
Z
Zeyu Meng
State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangzhou, China
M
Meng-Sha Zhou
State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangzhou, China
S
Shang Huang
State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangzhou, China
S
Shilong Yu
State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangzhou, China
Z
Zhengyu Duan
State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangzhou, China
S
Sutong Li
Eye Center, Zhongshan City People's Hospital, Zhongshan, Guangdong, China
H
Honghui Xia
Department of Ophthalmology, Zhaoqing Gaoyao People’s Hospital, Zhaoqing, China
J
Juping Liu
Tianjin Key Laboratory of Retinal Functions and Diseases, Tianjin Branch of National Clinical Research Center for Ocular Disease, Eye Institute and School of Optometry, Tianjin Medical University Eye Hospital, Tianjin, China
D
Dan Liang
State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangzhou, China
Yantao Wei
Yantao Wei
Professor, Central China Normal University
Machine LearningComputer VisionTarget Detection
X
Xiaoying Tang
Department of Electrical and Electronic Engineering, Southern University of Science and Technology, Shenzhen, China
J
Jin Yuan
Beijing Tongren Eye Center, Beijing Tongren Hospital, Capital Medical University, Beijing Key Laboratory of Ophthalmology & Visual Sciences, Beijing, China
P
P. Xiao
State Key Laboratory of Ophthalmology, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Guangzhou, China