Shortcut Learning in Glomerular AI: Adversarial Penalties Hurt, Entropy Helps

📅 2026-04-09

📈 Citations: 0

✨ Influential: 0

🤖 AI Summary

This study addresses the challenge of distribution shift and shortcut learning in renal pathology AI caused by staining variations, particularly in real-world settings where staining or institutional labels are unavailable for correction. To tackle this, the authors construct a multi-center, multi-stain glomerular image dataset and propose a Bayesian dual-head architecture that operates without staining labels. The framework integrates CNN and ViT backbones, Monte Carlo Dropout, and entropy maximization regularization to suppress reliance on spurious staining correlations while avoiding the increased uncertainty often introduced by adversarial training. Experiments demonstrate that the method maintains high accuracy and calibration in lesion classification while reducing staining prediction performance to random levels, thereby confirming its robustness against staining shortcuts and highlighting the intrinsic generalization benefits of the multi-stain dataset.

Technology Category

Application Category

📝 Abstract

Stain variability is a pervasive source of distribution shift and potential shortcut learning in renal pathology AI. We ask whether lupus nephritis glomerular lesion classifiers exploit stain as a shortcut, and how to mitigate such bias without stain or site labels. We curate a multi-center, multi-stain dataset of 9{,}674 glomerular patches (224$\times$224) from 365 WSIs across three centers and four stains (PAS, H\&E, Jones, Trichrome), labeled as proliferative vs.\ non-proliferative. We evaluate Bayesian CNN and ViT backbones with Monte Carlo dropout in three settings: (1) stain-only classification; (2) a dual-head model jointly predicting lesion and stain with supervised stain loss; and (3) a dual-head model with label-free stain regularization via entropy maximization on the stain head. In (1), stain identity is trivially learnable, confirming a strong candidate shortcut. In (2), varying the strength and sign of stain supervision strongly modulates stain performance but leaves lesion metrics essentially unchanged, indicating no measurable stain-driven shortcut learning on this multi-stain, multi-center dataset, while overly adversarial stain penalties inflate predictive uncertainty. In (3), entropy-based regularization holds stain predictions near chance without degrading lesion accuracy or calibration. Overall, a carefully curated multi-stain dataset can be inherently robust to stain shortcuts, and a Bayesian dual-head architecture with label-free entropy regularization offers a simple, deployment-friendly safeguard against potential stain-related drift in glomerular AI.

Problem

Research questions and friction points this paper is trying to address.

shortcut learning

stain variability

distribution shift

glomerular AI

bias mitigation

Innovation

Methods, ideas, or system contributions that make the work stand out.

shortcut learning

entropy regularization

multi-stain dataset

Bayesian dual-head architecture

stain invariance

🔎 Similar Papers

No similar papers found.

Authors to Follow