DepViT-CAD: Deployable Vision Transformer-Based Cancer Diagnosis in Histopathology

📅 2025-07-14
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the clinical need for precise, real-time multi-class cancer diagnosis in histopathological whole-slide images (WSIs), we propose a novel Multi-Attention Vision Transformer (MAViT). MAViT jointly models local morphological details and global structural context via complementary attention mechanisms, thereby enhancing discriminative capability for subtle inter-tumor morphological variations across 11 cancer types. Trained on 1,008 expert-annotated WSIs, the model achieves 94.11% sensitivity on the TCGA cohort and 92.0% on independent real-world clinical routine cases—surpassing existing ViT-based baselines. Designed for efficiency and robust generalization, MAViT features a lightweight architecture with reduced computational overhead while maintaining high accuracy and resilience to domain shifts. It has been successfully integrated into a deployable AI-assisted diagnostic system, delivering clinically actionable, high-fidelity, and robust decision support for pathologists.

Technology Category

Application Category

📝 Abstract
Accurate and timely cancer diagnosis from histopathological slides is vital for effective clinical decision-making. This paper introduces DepViT-CAD, a deployable AI system for multi-class cancer diagnosis in histopathology. At its core is MAViT, a novel Multi-Attention Vision Transformer designed to capture fine-grained morphological patterns across diverse tumor types. MAViT was trained on expert-annotated patches from 1008 whole-slide images, covering 11 diagnostic categories, including 10 major cancers and non-tumor tissue. DepViT-CAD was validated on two independent cohorts: 275 WSIs from The Cancer Genome Atlas and 50 routine clinical cases from pathology labs, achieving diagnostic sensitivities of 94.11% and 92%, respectively. By combining state-of-the-art transformer architecture with large-scale real-world validation, DepViT-CAD offers a robust and scalable approach for AI-assisted cancer diagnostics. To support transparency and reproducibility, software and code will be made publicly available at GitHub.
Problem

Research questions and friction points this paper is trying to address.

Develops AI system for multi-class cancer diagnosis
Captures fine-grained patterns in histopathology slides
Validates accuracy on real-world clinical datasets
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-Attention Vision Transformer for cancer diagnosis
Trained on 1008 expert-annotated whole-slide images
Validated on 325 clinical cases with high accuracy
🔎 Similar Papers
No similar papers found.
Ashkan Shakarami
Ashkan Shakarami
Computer Scientist & Postdoc Fellow in AI, Universität Bonn, Germany
AI/Computer VisionNeural NetworksComputer-Aided DiagnosisHistopathologyOphthalmology
L
Lorenzo Nicole
Unit of Surgical Pathology and Cytopathology, Ospedale dell’Angelo, Italy
Rocco Cappellesso
Rocco Cappellesso
University Hospital of Padova
Pathology
A
Angelo Paolo Dei Tos
Department of Medicine, University of Padova, Italy; Department of Integrated Diagnostics, Azienda Ospedale-Universit`a Padova, Italy
Stefano Ghidoni
Stefano Ghidoni
Full Professor, University of Padova, Italy
Computer visionRoboticsArtificial IntelligencePattern Recognition