Navigate Beyond Shortcuts: Debiased Learning through the Lens of Neural Collapse

📅 2024-05-09
🏛️ Computer Vision and Pattern Recognition
📈 Citations: 7
Influential: 1
📄 PDF
🤖 AI Summary
In attribute-imbalanced data, models often resort to shortcut learning, leading to non-collapsed yet biased feature representations. Method: This paper proposes a zero-overhead debiasing framework grounded in neural collapse theory—introducing neural collapse structure for the first time to mitigate shortcut learning. We design a “shortcut-avoidance” paradigm featuring a shortcut-prime guidance mechanism, symmetry constraints on the feature space, and end-to-end differentiable optimization, intervening at training initialization without additional parameters or computational cost. Contribution/Results: Theoretical analysis and experiments demonstrate that our method significantly suppresses early biased feature collapse on both synthetic and real-world biased datasets, enhances training stability, and achieves state-of-the-art generalization performance at zero extra overhead.

Technology Category

Application Category

📝 Abstract
Recent studies have noted an intriguing phenomenon termed Neural Collapse, that is, when the neural networks establish the right correlation between feature spaces and the training targets, their last-layer features, together with the classifier weights, will collapse into a stable and sym-metric structure. In this paper, we extend the investigation of Neural Collapse to the biased datasets with im-balanced attributes. We observe that models will easily fall into the pitfall of shortcut learning and form a biased, non-collapsed feature space at the early period of training, which is hard to reverse and limits the generalization capability. To tackle the root cause of biased classification, we follow the recent inspiration of prime training, and propose an avoid-shortcut learning framework without ad-ditional training complexity. With well-designed shortcut primes based on Neural Collapse structure, the models are encouraged to skip the pursuit of simple shortcuts and nat-urally capture the intrinsic correlations. Experimental re-sults demonstrate that our method induces better conver-gence properties during training, and achieves state-of-the-art generalization performance on both synthetic and real-world biased datasets.
Problem

Research questions and friction points this paper is trying to address.

Addresses biased learning in imbalanced datasets
Prevents shortcut learning using Neural Collapse principles
Improves generalization without extra training complexity
Innovation

Methods, ideas, or system contributions that make the work stand out.

Neural Collapse structure primes
Avoid-shortcut learning framework
No additional training complexity
🔎 Similar Papers
No similar papers found.
Y
Yining Wang
School of Computer Science, Fudan University, China
Junjie Sun
Junjie Sun
Student of Computer Science, Fudan University
artificial intelligence
C
Chenyue Wang
School of Computer Science, Fudan University, China
M
Mi Zhang
School of Computer Science, Fudan University, China
Min Yang
Min Yang
Bytedance
Vision Language ModelComputer VisionVideo Understanding