APTOS-2024 challenge report: Generation of synthetic 3D OCT images from fundus photographs

📅 2025-06-09
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the limited deployment of optical coherence tomography (OCT) in resource-constrained settings—due to its high cost and requirement for specialized operation—this study proposes a novel paradigm for synthesizing high-fidelity 3D OCT volumes from low-cost 2D fundus color photographs. We introduce the first benchmark for cross-modal fundus-to-3D-OCT synthesis and establish a two-tiered evaluation framework: pixel-level B-scan similarity and semantic-level volumetric consistency, rigorously validating clinical feasibility. Methodologically, our approach integrates cross-modal collaborative preprocessing, external fundus image pretraining, vision foundation model incorporation, and a customized 3D generative network. In an international challenge involving 342 teams, the top-performing method significantly improved anatomical fidelity and volumetric consistency. This work provides a practical, deployable solution for tele-ophthalmology and AI-assisted diagnosis in underserved regions.

Technology Category

Application Category

📝 Abstract
Optical Coherence Tomography (OCT) provides high-resolution, 3D, and non-invasive visualization of retinal layers in vivo, serving as a critical tool for lesion localization and disease diagnosis. However, its widespread adoption is limited by equipment costs and the need for specialized operators. In comparison, 2D color fundus photography offers faster acquisition and greater accessibility with less dependence on expensive devices. Although generative artificial intelligence has demonstrated promising results in medical image synthesis, translating 2D fundus images into 3D OCT images presents unique challenges due to inherent differences in data dimensionality and biological information between modalities. To advance generative models in the fundus-to-3D-OCT setting, the Asia Pacific Tele-Ophthalmology Society (APTOS-2024) organized a challenge titled Artificial Intelligence-based OCT Generation from Fundus Images. This paper details the challenge framework (referred to as APTOS-2024 Challenge), including: the benchmark dataset, evaluation methodology featuring two fidelity metrics-image-based distance (pixel-level OCT B-scan similarity) and video-based distance (semantic-level volumetric consistency), and analysis of top-performing solutions. The challenge attracted 342 participating teams, with 42 preliminary submissions and 9 finalists. Leading methodologies incorporated innovations in hybrid data preprocessing or augmentation (cross-modality collaborative paradigms), pre-training on external ophthalmic imaging datasets, integration of vision foundation models, and model architecture improvement. The APTOS-2024 Challenge is the first benchmark demonstrating the feasibility of fundus-to-3D-OCT synthesis as a potential solution for improving ophthalmic care accessibility in under-resourced healthcare settings, while helping to expedite medical research and clinical applications.
Problem

Research questions and friction points this paper is trying to address.

Generating 3D OCT images from 2D fundus photos
Overcoming cost and accessibility of OCT imaging
Addressing dimensionality gaps in cross-modality synthesis
Innovation

Methods, ideas, or system contributions that make the work stand out.

Generative AI for 2D fundus to 3D OCT synthesis
Hybrid data preprocessing and augmentation techniques
Integration of vision foundation models
🔎 Similar Papers
No similar papers found.
Bowen Liu
Bowen Liu
Andreessen Horowitz, insitro, Stanford
Computational ChemistryDrug DiscoveryGraph Machine Learning
W
Weiyi Zhang
School of Optometry, The Hong Kong Polytechnic University, Kowloon, Hong Kong; Research Centre for SHARP Vision (RCSV), The Hong Kong Polytechnic University, Kowloon, Hong Kong
P
Peranut Chotcomwongse
Department of Ophthalmology, College of Medicine, Rangsit University, Rajavithi Hospital, Thailand
X
Xiaolan Chen
School of Optometry, The Hong Kong Polytechnic University, Kowloon, Hong Kong; Research Centre for SHARP Vision (RCSV), The Hong Kong Polytechnic University, Kowloon, Hong Kong
Ruoyu Chen
Ruoyu Chen
Institute of Information Engineering, Chinese Academy of Sciences.
Explainable AITrustworthy AIFoundation Model
P
Pawin Pakaymaskul
Department of Ophthalmology, College of Medicine, Rangsit University, Rajavithi Hospital, Thailand
N
Niracha Arjkongharn
Department of Ophthalmology, College of Medicine, Rangsit University, Rajavithi Hospital, Thailand
N
Nattaporn Vongsa
Department of Ophthalmology, College of Medicine, Rangsit University, Rajavithi Hospital, Thailand
Xuelian Cheng
Xuelian Cheng
Monash University
3D VisionMedical ImagingMachine Learning
Z
Zongyuan Ge
eResearch Center and Faculty of Engineering, Monash University, Clayton, VIC, 3800 Australia
K
Kun Huang
Nanjing University of Science and Technology, Nanjing 210000, Jiangsu, China
X
Xiaohui Li
Nanjing University of Science and Technology, Nanjing 210000, Jiangsu, China
Y
Yiru Duan
Nanjing University of Science and Technology, Nanjing 210000, Jiangsu, China
Z
Zhenbang Wang
Nanjing University of Science and Technology, Nanjing 210000, Jiangsu, China
B
BaoYe Xie
Nanjing University of Science and Technology, Nanjing 210000, Jiangsu, China
Q
Qiang Chen
Nanjing University of Science and Technology, Nanjing 210000, Jiangsu, China
Huazhu Fu
Huazhu Fu
Principal Scientist, IHPC, A*STAR
Medical Image AnalysisAI for HealthcareMedical AITrustworthy AI
M
M. A. Mahr
Mayo Clinic Department of Ophthalmology, Rochester, MN USA
J
Jiaqi Qu
China Medical University, Shenyang 110122, Liaoning, China
W
Wangyiyang Chen
Northeastern University, Shenyang 110819, Liaoning, China
S
Shiye Wang
Henan Kaifeng College of Science Technology and Communication, Kaifeng 475001, China
Y
Yubo Tan
University of Electronic Science and Technology of China, Chengdu 610054, China
Yongjie Li
Yongjie Li
University of Electronic Science and Technology of China
Biological vision modelingComputational vision and image processing
Mingguang He
Mingguang He
The Hong Kong Polytechnic University
Ophthalmoogy
D
Danli Shi
School of Optometry, The Hong Kong Polytechnic University, Kowloon, Hong Kong; Research Centre for SHARP Vision (RCSV), The Hong Kong Polytechnic University, Kowloon, Hong Kong
P
Paisan Ruamviboonsuk
Department of Ophthalmology, College of Medicine, Rangsit University, Rajavithi Hospital, Thailand