Diffusion Model-based Data Augmentation Method for Fetal Head Ultrasound Segmentation

📅 2025-06-30
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the scarcity, high annotation cost, and privacy constraints limiting labeled fetal cranial ultrasound data, this paper proposes a mask-guided diffusion model that jointly synthesizes high-fidelity ultrasound images and their corresponding segmentation masks—enabling paired synthetic data augmentation. Crucially, the method embeds segmentation priors directly into the diffusion process, substantially improving the anatomical plausibility and task relevance of generated samples. These synthetic data are subsequently employed to supervise fine-tuning of the Segment Anything Model (SAM). Evaluated on two real-world low-resource cohorts—Spanish and African—the approach achieves state-of-the-art Dice scores of 94.66% and 94.38%, respectively. Results demonstrate both the efficacy and strong cross-domain generalizability of synthetic data for medical image segmentation under severe data scarcity.

Technology Category

Application Category

📝 Abstract
Medical image data is less accessible than in other domains due to privacy and regulatory constraints. In addition, labeling requires costly, time-intensive manual image annotation by clinical experts. To overcome these challenges, synthetic medical data generation offers a promising solution. Generative AI (GenAI), employing generative deep learning models, has proven effective at producing realistic synthetic images. This study proposes a novel mask-guided GenAI approach using diffusion models to generate synthetic fetal head ultrasound images paired with segmentation masks. These synthetic pairs augment real datasets for supervised fine-tuning of the Segment Anything Model (SAM). Our results show that the synthetic data captures real image features effectively, and this approach reaches state-of-the-art fetal head segmentation, especially when trained with a limited number of real image-mask pairs. In particular, the segmentation reaches Dice Scores of 94.66% and 94.38% using a handful of ultrasound images from the Spanish and African cohorts, respectively. Our code, models, and data are available on GitHub.
Problem

Research questions and friction points this paper is trying to address.

Limited access to medical image data due to privacy constraints
High cost and time for manual annotation by experts
Need for synthetic data to improve fetal ultrasound segmentation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Diffusion models generate synthetic ultrasound images
Mask-guided approach pairs images with segmentation
Augments real data for SAM fine-tuning
🔎 Similar Papers
No similar papers found.
Fangyijie Wang
Fangyijie Wang
School of Medicine, University College Dublin
Computer VisionMedical ImagingDeep Learning
K
Kevin Whelan
School of Computer Science, University College Dublin, Dublin, Ireland
F
Félix Balado
School of Computer Science, University College Dublin, Dublin, Ireland
G
Guénolé Silvestre
Research Ireland Centre for Research Training in Machine Learning; School of Computer Science, University College Dublin, Dublin, Ireland
K
Kathleen M. Curran
Research Ireland Centre for Research Training in Machine Learning; School of Medicine, University College Dublin, Dublin, Ireland