XGraphRAG: Interactive Visual Analysis for Graph-based Retrieval-Augmented Generation

📅 2025-06-10
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
GraphRAG systems suffer from poor debuggability and interpretability due to heavy reliance on large language model (LLM) calls during graph construction and querying, coupled with opaque, black-box pipelines—hindering effectiveness evaluation and failure diagnosis. To address this, we propose the first end-to-end visual analytics paradigm for GraphRAG, covering the full pipeline: graph construction → subgraph retrieval → LLM-based generation. Our interactive web framework integrates synchronized multi-view visualization, query-path highlighting, subgraph provenance mapping, and fine-grained correlation of LLM invocation logs. This enables comprehensive traceability across all stages. Empirical evaluation demonstrates a 3.2× acceleration in fault attribution and a 41% improvement in key recall identification accuracy. The open-source system has been widely adopted by the research and practitioner community, supporting GraphRAG optimization in multiple real-world applications.

Technology Category

Application Category

📝 Abstract
Graph-based Retrieval-Augmented Generation (RAG) has shown great capability in enhancing Large Language Model (LLM)'s answer with an external knowledge base. Compared to traditional RAG, it introduces a graph as an intermediate representation to capture better structured relational knowledge in the corpus, elevating the precision and comprehensiveness of generation results. However, developers usually face challenges in analyzing the effectiveness of GraphRAG on their dataset due to GraphRAG's complex information processing pipeline and the overwhelming amount of LLM invocations involved during graph construction and query, which limits GraphRAG interpretability and accessibility. This research proposes a visual analysis framework that helps RAG developers identify critical recalls of GraphRAG and trace these recalls through the GraphRAG pipeline. Based on this framework, we develop XGraphRAG, a prototype system incorporating a set of interactive visualizations to facilitate users' analysis process, boosting failure cases collection and improvement opportunities identification. Our evaluation demonstrates the effectiveness and usability of our approach. Our work is open-sourced and available at https://github.com/Gk0Wk/XGraphRAG.
Problem

Research questions and friction points this paper is trying to address.

Analyzing GraphRAG effectiveness on datasets
Understanding complex GraphRAG information processing pipeline
Improving GraphRAG interpretability and accessibility
Innovation

Methods, ideas, or system contributions that make the work stand out.

Graph-based RAG enhances LLM with structured knowledge
Visual analysis framework improves GraphRAG interpretability
Interactive visualizations aid failure analysis and improvements
🔎 Similar Papers
No similar papers found.
K
Ke Wang
State Key Lab of CAD&CG, Zhejiang University
B
Bo Pan
State Key Lab of CAD&CG, Zhejiang University
Yingchaojie Feng
Yingchaojie Feng
Zhejiang University
Visual AnalyticsNatural Language ProcessingHuman Computer Interaction
Yuwei Wu
Yuwei Wu
Ph.D. candidate, GRASP Lab, University of Pennsylvania
RoboticsTrajectory OptimizationTask and Motion Planning
J
Jieyi Chen
State Key Lab of CAD&CG, Zhejiang University
Minfeng Zhu
Minfeng Zhu
Zhejiang University
VisualisationMath
W
Wei Chen
State Key Lab of CAD&CG, Zhejiang University