Interpretable Image Classification via Non-parametric Part Prototype Learning

📅 2025-03-13

📈 Citations: 0

✨ Influential: 0

career value

190K/year

🤖 AI Summary

Existing prototype-based networks (e.g., ProtoPNet) suffer from prototype redundancy and semantic overlap, leading to insufficient explanation diversity and discriminability. To address this, we propose a non-parametric part prototype learning framework: leveraging features from foundation vision models (e.g., ViT), it performs unsupervised, non-parametric clustering to automatically discover semantically distinct, diverse, and non-redundant part prototypes per class. We introduce two novel quantitative metrics—Distinctiveness Score and Comprehensiveness Score—to rigorously evaluate explanation quality. Classification is performed via prototype-matching-driven weighted prediction. On CUB-200, Stanford Cars, and Oxford-IIIT Pets, our method simultaneously improves classification accuracy (+1.2–2.8%) and explanation quality (diversity ↑37%, discriminability ↑29%). The code is publicly available.

Technology Category

Application Category

📝 Abstract

Classifying images with an interpretable decision-making process is a long-standing problem in computer vision. In recent years, Prototypical Part Networks has gained traction as an approach for self-explainable neural networks, due to their ability to mimic human visual reasoning by providing explanations based on prototypical object parts. However, the quality of the explanations generated by these methods leaves room for improvement, as the prototypes usually focus on repetitive and redundant concepts. Leveraging recent advances in prototype learning, we present a framework for part-based interpretable image classification that learns a set of semantically distinctive object parts for each class, and provides diverse and comprehensive explanations. The core of our method is to learn the part-prototypes in a non-parametric fashion, through clustering deep features extracted from foundation vision models that encode robust semantic information. To quantitatively evaluate the quality of explanations provided by ProtoPNets, we introduce Distinctiveness Score and Comprehensiveness Score. Through evaluation on CUB-200-2011, Stanford Cars and Stanford Dogs datasets, we show that our framework compares favourably against existing ProtoPNets while achieving better interpretability. Code is available at: https://github.com/zijizhu/proto-non-param.

Problem

Research questions and friction points this paper is trying to address.

Improving interpretability in image classification using part prototypes.

Enhancing explanation quality by learning semantically distinctive object parts.

Evaluating explanation quality with Distinctiveness and Comprehensiveness Scores.

Innovation

Methods, ideas, or system contributions that make the work stand out.

Non-parametric part-prototype learning for interpretability

Clustering deep features from foundation vision models

Distinctiveness and Comprehensiveness Scores for evaluation

🔎 Similar Papers

Mixture of Gaussian-distributed Prototypes with Generative Modelling for Interpretable and Trustworthy Image Recognition