DisasterLex: An Expert Concept-to-Schema Knowledge Graph for Geospatial Reasoning in Disaster Analytics

📅 2026-05-28
📈 Citations: 0
Influential: 0
📄 PDF

career value

172K/year
🤖 AI Summary
This study addresses the limitations of existing text-to-SQL methods in disaster analysis, where heterogeneous geospatial data patterns and causal reasoning requirements pose significant challenges. To overcome these issues, the authors propose a novel framework that integrates an expert knowledge graph (EKG) with causal inference. The approach leverages a concept-to-table schema mapping and a four-stage query orchestration mechanism to enable precise access to cross-table geospatial data. Notably, it is the first to combine EKGs with causal edge reasoning, progressively constraining schema traversal across stages to substantially improve domain routing and SQL generation accuracy. Evaluated on 75 test queries, the method outperforms four state-of-the-art baselines by 1.4–2.75×, achieving absolute scores of 1.65–3.56 on a 5.0 scale.
📝 Abstract
Disasters are inevitable and increasingly costly, and effective response depends on querying structured tabular data: precise, information-dense records of hazard, exposure, vulnerability, and lifeline infrastructure that underpin disaster management. Current text-to-SQL methods enable natural-language access to such tables but transfer poorly to the disaster domain, where queries span heterogeneous geospatial schemas and require reasoning over causal relations. We introduce DisasterLex, a knowledge-graph-mediated framework that inserts an Expert Knowledge Graph (EKG) of curated concepts and typed causal edges between the user query and the database, bridged to schema by concept-to-table links. The orchestration runs four stages (identifying query entities, routing to the operational domain, planning over causal edges, and grounding the SQL), restricting the schema passed to the model at each step. We instantiate it on a disaster-analytics database (36 geospatial tables, 150 columns) with an EKG of 107 concepts, 117 causal edges, and 52 concept-to-schema links, evaluated on a 75-query test set. On all seven base models spanning proprietary and open-weight families, DisasterLex beats four state-of-the-art baselines (LightRAG, HippoRAG 2, ReFoRCE, CHESS) by 1.4x to 2.75x, with absolute scores of 1.65 to 3.56 (of 5.0). Error analysis shows baseline failures cluster in routing and multi-table SQL composition, the operations our orchestration explicitly addresses. Code, data, and the EKG artifact are available at https://github.com/YimingXiao98/DisasterLex and on Zenodo at https://doi.org/10.5281/zenodo.20388029.
Problem

Research questions and friction points this paper is trying to address.

text-to-SQL
disaster analytics
geospatial reasoning
causal relations
heterogeneous schemas
Innovation

Methods, ideas, or system contributions that make the work stand out.

Expert Knowledge Graph
geospatial reasoning
text-to-SQL
disaster analytics
causal relations
🔎 Similar Papers
No similar papers found.