Explainable Graph Spectral Clustering For Text Embeddings

📅 2025-08-12
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the limited interpretability of text-embedding-driven graph clustering. We propose a general interpretability framework tailored to multiple static word embedding types—particularly GloVe. Methodologically, we model document similarity as cosine similarity in the word vector space, construct a semantic graph, perform graph-based clustering, and integrate visualization with feature attribution techniques to yield semantic-level explanations of clustering outcomes. Our key contribution is the first systematic extension of graph clustering interpretability to non-contextual, static embeddings—thereby overcoming prior reliance on contextualized models like BERT. Experimental results demonstrate that the framework consistently enhances clustering transparency and generalizability across diverse semantic spaces, significantly improving the understandability and trustworthiness of model decisions.

Technology Category

Application Category

📝 Abstract
In a previous paper, we proposed an introduction to the explainability of Graph Spectral Clustering results for textual documents, given that document similarity is computed as cosine similarity in term vector space. In this paper, we generalize this idea by considering other embeddings of documents, in particular, based on the GloVe embedding idea.
Problem

Research questions and friction points this paper is trying to address.

Extends explainable graph clustering to diverse text embeddings
Generalizes spectral clustering explainability beyond cosine similarity
Applies explainable methods to GloVe-based document embeddings
Innovation

Methods, ideas, or system contributions that make the work stand out.

Extends spectral clustering to GloVe embeddings
Generalizes explainability across document embedding types
Uses cosine similarity in vector space analysis
🔎 Similar Papers
No similar papers found.
M
Mieczysław A. Kłopotek
Institute of Computer Science of Polish Academy of Sciences, ul. Jana Kazimierza 5, 01-248 Warszawa, Poland
S
Sławomir T. Wierzchoń
Institute of Computer Science of Polish Academy of Sciences, ul. Jana Kazimierza 5, 01-248 Warszawa, Poland
B
Bartłomiej Starosta
Institute of Computer Science of Polish Academy of Sciences, ul. Jana Kazimierza 5, 01-248 Warszawa, Poland
Piotr Borkowski
Piotr Borkowski
Polish Academy of Sciences
artificial intelligencestatistics
Dariusz Czerski
Dariusz Czerski
Instytut Podstaw Informatyki Polskiej Akademii Nauk
sztuczna inteligencja
E
Eryk Laskowski
Institute of Computer Science of Polish Academy of Sciences, ul. Jana Kazimierza 5, 01-248 Warszawa, Poland