Shortcut Learning in Glomerular AI: Adversarial Penalties Hurt, Entropy Helps

📅 2026-04-09
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the challenge of distribution shift and shortcut learning in renal pathology AI caused by staining variations, particularly in real-world settings where staining or institutional labels are unavailable for correction. To tackle this, the authors construct a multi-center, multi-stain glomerular image dataset and propose a Bayesian dual-head architecture that operates without staining labels. The framework integrates CNN and ViT backbones, Monte Carlo Dropout, and entropy maximization regularization to suppress reliance on spurious staining correlations while avoiding the increased uncertainty often introduced by adversarial training. Experiments demonstrate that the method maintains high accuracy and calibration in lesion classification while reducing staining prediction performance to random levels, thereby confirming its robustness against staining shortcuts and highlighting the intrinsic generalization benefits of the multi-stain dataset.

Technology Category

Application Category

📝 Abstract
Stain variability is a pervasive source of distribution shift and potential shortcut learning in renal pathology AI. We ask whether lupus nephritis glomerular lesion classifiers exploit stain as a shortcut, and how to mitigate such bias without stain or site labels. We curate a multi-center, multi-stain dataset of 9{,}674 glomerular patches (224$\times$224) from 365 WSIs across three centers and four stains (PAS, H\&E, Jones, Trichrome), labeled as proliferative vs.\ non-proliferative. We evaluate Bayesian CNN and ViT backbones with Monte Carlo dropout in three settings: (1) stain-only classification; (2) a dual-head model jointly predicting lesion and stain with supervised stain loss; and (3) a dual-head model with label-free stain regularization via entropy maximization on the stain head. In (1), stain identity is trivially learnable, confirming a strong candidate shortcut. In (2), varying the strength and sign of stain supervision strongly modulates stain performance but leaves lesion metrics essentially unchanged, indicating no measurable stain-driven shortcut learning on this multi-stain, multi-center dataset, while overly adversarial stain penalties inflate predictive uncertainty. In (3), entropy-based regularization holds stain predictions near chance without degrading lesion accuracy or calibration. Overall, a carefully curated multi-stain dataset can be inherently robust to stain shortcuts, and a Bayesian dual-head architecture with label-free entropy regularization offers a simple, deployment-friendly safeguard against potential stain-related drift in glomerular AI.
Problem

Research questions and friction points this paper is trying to address.

shortcut learning
stain variability
distribution shift
glomerular AI
bias mitigation
Innovation

Methods, ideas, or system contributions that make the work stand out.

shortcut learning
entropy regularization
multi-stain dataset
Bayesian dual-head architecture
stain invariance
🔎 Similar Papers
No similar papers found.
M
Mohammad Daouk
University of Houston, Houston, TX, USA
J
Jan Ulrich Becker
University Hospital Cologne, Cologne, Germany
N
Neeraja Kambham
Stanford University, Stanford, CA, USA
A
Anthony Chang
The University of Chicago, Chicago, IL, USA
Hien Nguyen
Hien Nguyen
PhD Student
C
Chandra Mohan
University of Houston, Houston, TX, USA