Self-supervised Normality Learning and Divergence Vector-guided Model Merging for Zero-shot Congenital Heart Disease Detection in Fetal Ultrasound Videos

📅 2025-03-10
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Zero-shot detection of congenital heart disease (CHD) in fetal ultrasound videos poses significant challenges due to data scarcity and strict privacy constraints across clinical centers. Method: We propose a privacy-preserving, data-free federated framework wherein each center trains solely on local normal fetal echocardiographic videos. A sparse Tube-based self-supervised learning paradigm is introduced to extract healthy spatiotemporal representations. CHD detection is reformulated as “normality learning,” and a novel DivMerge strategy is proposed—leveraging divergence-guided vector aggregation to enable federated model fusion without sharing raw data or labels. The method integrates spatiotemporal feature disentanglement, self-distillation loss, and unsupervised anomaly scoring. Results: Evaluated on real-world data from five hospitals, the fused model achieves 23.77% higher accuracy and 30.13% higher F1-score on external test cohorts compared to single-center baselines, demonstrating strong zero-shot generalization while guaranteeing strict data privacy.

Technology Category

Application Category

📝 Abstract
Congenital Heart Disease (CHD) is one of the leading causes of fetal mortality, yet the scarcity of labeled CHD data and strict privacy regulations surrounding fetal ultrasound (US) imaging present significant challenges for the development of deep learning-based models for CHD detection. Centralised collection of large real-world datasets for rare conditions, such as CHD, from large populations requires significant co-ordination and resource. In addition, data governance rules increasingly prevent data sharing between sites. To address these challenges, we introduce, for the first time, a novel privacy-preserving, zero-shot CHD detection framework that formulates CHD detection as a normality modeling problem integrated with model merging. In our framework dubbed Sparse Tube Ultrasound Distillation (STUD), each hospital site first trains a sparse video tube-based self-supervised video anomaly detection (VAD) model on normal fetal heart US clips with self-distillation loss. This enables site-specific models to independently learn the distribution of healthy cases. To aggregate knowledge across the decentralized models while maintaining privacy, we propose a Divergence Vector-Guided Model Merging approach, DivMerge, that combines site-specific models into a single VAD model without data exchange. Our approach preserves domain-agnostic rich spatio-temporal representations, ensuring generalization to unseen CHD cases. We evaluated our approach on real-world fetal US data collected from 5 hospital sites. Our merged model outperformed site-specific models by 23.77% and 30.13% in accuracy and F1-score respectively on external test sets.
Problem

Research questions and friction points this paper is trying to address.

Detecting congenital heart disease in fetal ultrasound videos.
Overcoming data scarcity and privacy in CHD detection.
Merging decentralized models without sharing data.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Self-supervised normality learning for anomaly detection
Divergence Vector-Guided Model Merging (DivMerge)
Privacy-preserving zero-shot CHD detection framework
🔎 Similar Papers
No similar papers found.
Pramit Saha
Pramit Saha
Department of Engineering Science, University of Oxford
Deep LearningFederated LearningMultimodal LearningComputer VisionMedical Image Analysis
Divyanshu Mishra
Divyanshu Mishra
DPhil Student at University of Oxford
Video UnderstandingVideo SSLMedical Image AnalysisMulti-Modal LearningUltrasound
N
N. Hernandez-Cruz
Department of Engineering Science, University of Oxford
O
O. Patey
Nuffield Department of Women’s and Reproductive Health, University of Oxford
A
Aris T. Papageorghiou
Nuffield Department of Women’s and Reproductive Health, University of Oxford
Yuki M. Asano
Yuki M. Asano
Full Professor, Head of FunAI Lab, University of Technology Nuremberg
Deep LearningMultimodal LearningSelf-supervised LearningLarge Model AdaptationLLMs
J
J. A. Noble
Department of Engineering Science, University of Oxford