🤖 AI Summary
In high-stakes domains such as healthcare, balancing model interpretability and accuracy remains a critical challenge for convolutional neural networks. To address this, we propose Batch-CAM: an end-to-end trainable framework that, for the first time, integrates batch-level Grad-CAM attention into the training pipeline and couples it with a prototype reconstruction loss. By explicitly guiding the model to attend to discriminative image regions, Batch-CAM jointly optimizes classification performance and explanation fidelity. Evaluated on multiple medical imaging benchmarks, it achieves consistent improvements—average classification accuracy increases by +1.3%, heatmap–ground-truth lesion overlap (IoU) improves by 12.7%, and prototype-driven image reconstruction quality is enhanced. Moreover, it reduces training time by 19% and inference latency by 23%. Our core contribution lies in achieving synergistic optimization across accuracy, interpretability, and computational efficiency—establishing a novel paradigm for trustworthy AI in safety-critical applications.
📝 Abstract
Understanding the inner workings of deep learning models is crucial for advancing artificial intelligence, particularly in high-stakes fields such as healthcare, where accurate explanations are as vital as precision. This paper introduces Batch-CAM, a novel training paradigm that fuses a batch implementation of the Grad-CAM algorithm with a prototypical reconstruction loss. This combination guides the model to focus on salient image features, thereby enhancing its performance across classification tasks. Our results demonstrate that Batch-CAM achieves a simultaneous improvement in accuracy and image reconstruction quality while reducing training and inference times. By ensuring models learn from evidence-relevant information,this approach makes a relevant contribution to building more transparent, explainable, and trustworthy AI systems.