Prototype-Based Multiple Instance Learning for Gigapixel Whole Slide Image Classification

📅 2025-03-11

📈 Citations: 0

✨ Influential: 0

career value

176K/year

🤖 AI Summary

Current multiple instance learning (MIL) models for whole-slide image (WSI) analysis rely heavily on attention mechanisms, yielding weak interpretability and non-editable predictions—failing to meet clinical requirements for trustworthiness. To address this, we propose ProtoMIL, the first framework that deeply integrates prototype learning with MIL for interpretable WSI analysis. ProtoMIL employs a sparse autoencoder to automatically discover semantically coherent visual prototypes; predictions are transparently formulated as linear combinations of these prototypes, enabling real-time clinician intervention and correction. This ensures the model is “right for the right reasons,” overcoming the black-box nature and immutability of attention-based explanations. ProtoMIL achieves state-of-the-art classification accuracy on two major pathology benchmarks while delivering concept-level, human-understandable interpretations. Empirical results demonstrate that manual removal of irrelevant prototypes significantly improves model robustness and diagnostic credibility.

Technology Category

Application Category

📝 Abstract

Multiple Instance Learning (MIL) methods have succeeded remarkably in histopathology whole slide image (WSI) analysis. However, most MIL models only offer attention-based explanations that do not faithfully capture the model's decision mechanism and do not allow human-model interaction. To address these limitations, we introduce ProtoMIL, an inherently interpretable MIL model for WSI analysis that offers user-friendly explanations and supports human intervention. Our approach employs a sparse autoencoder to discover human-interpretable concepts from the image feature space, which are then used to train ProtoMIL. The model represents predictions as linear combinations of concepts, making the decision process transparent. Furthermore, ProtoMIL allows users to perform model interventions by altering the input concepts. Experiments on two widely used pathology datasets demonstrate that ProtoMIL achieves a classification performance comparable to state-of-the-art MIL models while offering intuitively understandable explanations. Moreover, we demonstrate that our method can eliminate reliance on diagnostically irrelevant information via human intervention, guiding the model toward being right for the right reason. Code will be publicly available at https://github.com/ss-sun/ProtoMIL.

Problem

Research questions and friction points this paper is trying to address.

Develops interpretable MIL model for WSI analysis

Enables human intervention in model decision process

Improves classification transparency and diagnostic relevance

Innovation

Methods, ideas, or system contributions that make the work stand out.

Uses sparse autoencoder for interpretable concepts

Represents predictions via linear concept combinations

Allows human intervention to alter input concepts

🔎 Similar Papers

No similar papers found.