Rethinking Bias in Generative Data Augmentation for Medical AI: a Frequency Recalibration Method

📅 2025-11-15
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses performance degradation in medical generative data augmentation (GDA) caused by frequency-domain distribution mismatch—termed *frequency bias*—between real and synthetic images. We propose FreRec, a training-free post-processing framework that identifies frequency misalignment as the primary cause of synthetic image quality degradation. FreRec introduces a two-stage frequency-domain calibration strategy: Statistical High-frequency Replacement (SHR) and Reconstructed High-frequency Mapping (RHM), which jointly recalibrate and reconstruct high-frequency components in synthetic images. The method is model-agnostic and seamlessly integrates into existing GDA pipelines. Extensive evaluation across diverse medical imaging modalities—including brain MRI, chest X-ray, and fundus photography—demonstrates that FreRec consistently improves classification accuracy and effectively mitigates performance collapse under low-data regimes. By bridging the frequency-domain gap between real and synthetic data, FreRec establishes a novel paradigm for trustworthy medical AI data augmentation.

Technology Category

Application Category

📝 Abstract
Developing Medical AI relies on large datasets and easily suffers from data scarcity. Generative data augmentation (GDA) using AI generative models offers a solution to synthesize realistic medical images. However, the bias in GDA is often underestimated in medical domains, with concerns about the risk of introducing detrimental features generated by AI and harming downstream tasks. This paper identifies the frequency misalignment between real and synthesized images as one of the key factors underlying unreliable GDA and proposes the Frequency Recalibration (FreRec) method to reduce the frequency distributional discrepancy and thus improve GDA. FreRec involves (1) Statistical High-frequency Replacement (SHR) to roughly align high-frequency components and (2) Reconstructive High-frequency Mapping (RHM) to enhance image quality and reconstruct high-frequency details. Extensive experiments were conducted in various medical datasets, including brain MRIs, chest X-rays, and fundus images. The results show that FreRec significantly improves downstream medical image classification performance compared to uncalibrated AI-synthesized samples. FreRec is a standalone post-processing step that is compatible with any generative model and can integrate seamlessly with common medical GDA pipelines.
Problem

Research questions and friction points this paper is trying to address.

Addressing frequency misalignment between real and synthetic medical images
Reducing harmful biases in generative data augmentation for medical AI
Improving downstream medical classification through frequency distribution calibration
Innovation

Methods, ideas, or system contributions that make the work stand out.

Frequency Recalibration method reduces distributional discrepancy
Statistical High-frequency Replacement aligns frequency components
Reconstructive High-frequency Mapping enhances image details
🔎 Similar Papers
No similar papers found.
C
Chi Liu
Faculty of Data Science, City University of Macau, Macao SAR, China
J
Jincheng Liu
Faculty of Data Science, City University of Macau, Macao SAR, China
Congcong Zhu
Congcong Zhu
USTC
Multimedia Understanding
M
Minghao Wang
Faculty of Data Science, City University of Macau, Macao SAR, China
Sheng Shen
Sheng Shen
Facebook Reality Labs
Mobile ComputingWireless SensingInternet of Things (IoT)AR/VRLocalization
J
Jia Gu
Faculty of Data Science, City University of Macau, Macao SAR, China
Tianqing Zhu
Tianqing Zhu
City University of Macau
PrivacyCyber SecurityMachine LearningAI Security
Wanlei Zhou
Wanlei Zhou
Professor, City University of Macau, Macao
Parallel and Distributed SystemsIT SecuritySecurity and PrivacyCyber Securitye-learning