Beyond the Veil of Similarity: Quantifying Semantic Continuity in Explainable AI

📅 2024-07-17
🏛️ xAI
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing model explanations in explainable AI (XAI) suffer from semantic discontinuity and concept drift—i.e., inconsistent conceptual interpretations across semantically similar inputs—undermining interpretability and trust. Method: We propose the first formal framework for quantifying *semantic continuity*, defined as the robustness of explanation concepts to small input perturbations. Our approach introduces a differentiable metric grounded in concept-space mapping, integrating Concept Activation Vectors (CAVs), adversarial perturbation analysis, and semantic embedding alignment. We further design a continuity-aware gradient regularization strategy for training explanation models. Results: Evaluated on ImageNet and COCO-XAI benchmarks, our method improves explanation stability by 37.2%, significantly enhancing human interpretability and decision reliability. This work overcomes the limitation of conventional similarity-based evaluation metrics—which ignore semantic drift—and establishes a new paradigm for rigorous assessment and optimization of explanation continuity in XAI.

Technology Category

Application Category

Problem

Research questions and friction points this paper is trying to address.

Artificial Intelligence
Explanation Stability
Semantic Coherence
Innovation

Methods, ideas, or system contributions that make the work stand out.

Stability Assessment
Explainable AI
Semantic Coherence
🔎 Similar Papers
No similar papers found.
Q
Qi Huang
Leiden University, Leiden, The Netherlands
E
Emanuele Mezzi
Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
Osman Mutlu
Osman Mutlu
Wageningen University & Research
Explainable Artificial IntelligenceFederated LearningNatural Language Processing
Miltiadis Kofinas
Miltiadis Kofinas
University of Amsterdam, Amsterdam, The Netherlands
Vidya Prasad
Vidya Prasad
Eindhoven University of Technology, Eindhoven, The Netherlands
S
Shadnan Azwad Khan
Sorbonne University, Paris, France
E
Elena Ranguelova
Netherlands eScience center, Amsterdam, The Netherlands
N
N. V. Stein
Leiden University, Leiden, The Netherlands