🤖 AI Summary
Addressing the challenge of simultaneously optimizing recall and prediction calibration under extreme class imbalance—particularly in medical diagnosis—this paper proposes AF-SMOTE, a novel data augmentation framework integrating an adversarial discriminator with a boundary utility model. Building upon SMOTE-generated minority-class samples, AF-SMOTE introduces a boundary-aware adversarial filtering mechanism, theoretically proven to monotonically optimize the F<sub>β</sub> score without compromising calibration. Leveraging assumptions of decision boundary smoothness and class-conditional density, it enhances the reliability of rare-case detection. Empirical evaluation on multiple benchmarks—including MIMIC-IV and fraud detection datasets—demonstrates that AF-SMOTE consistently outperforms mainstream methods (e.g., SMOTE, ADASYN) in recall, average precision, and calibration metrics (ECE and Brier score). Moreover, it exhibits strong generalization across diverse disease domains, underscoring its robustness and clinical applicability.
📝 Abstract
We study classification under extreme class imbalance where recall and calibration are both critical, for example in medical diagnosis scenarios. We propose AF-SMOTE, a mathematically motivated augmentation framework that first synthesizes minority points and then filters them by an adversarial discriminator and a boundary utility model. We prove that, under mild assumptions on the decision boundary smoothness and class-conditional densities, our filtering step monotonically improves a surrogate of F_beta (for beta >= 1) while not inflating Brier score. On MIMIC-IV proxy label prediction and canonical fraud detection benchmarks, AF-SMOTE attains higher recall and average precision than strong oversampling baselines (SMOTE, ADASYN, Borderline-SMOTE, SVM-SMOTE), and yields the best calibration. We further validate these gains across multiple additional datasets beyond MIMIC-IV. Our successful application of AF-SMOTE to a healthcare dataset using a proxy label demonstrates in a disease-agnostic way its practical value in clinical situations, where missing true positive cases in rare diseases can have severe consequences.