Efficient Path Query Processing in Relational Database Systems

📅 2026-04-02
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing graph and relational database systems struggle to efficiently evaluate path queries that combine regular expressions with complex attribute constraints, primarily due to their inability to perform early pruning of intermediate results. This work proposes ReCAP, an abstraction mechanism that explicitly models nondeterministic finite automaton (NFA) state transitions and integrates a minimal set of user-defined functions to deeply push arbitrary attribute constraints down into the lowest layers of the query execution plan. By embedding this approach within optimization frameworks of relational databases such as DuckDB, ReCAP enables highly effective early filtering of intermediate results. Empirical evaluation across diverse graph datasets and query workloads demonstrates performance improvements of up to 400,000× over state-of-the-art systems.
📝 Abstract
Path queries are crucial for property graphs, and there is growing interest in queries that combine regular expressions over labels with constraints on property values of vertices and edges. Efficient evaluation of such general path queries requires that intermediate results be eliminated early when there is no possible completion to a full result path. Neither state-of-the-art (SOA) graph DBMS nor relational DBMS currently can do this effectively for a large class of queries. We show that this problem can be addressed by giving a relational optimizer ``a little help'' by specifying early filtering opportunities explicitly in the query. To this end, we propose ReCAP, an abstraction that greatly simplifies the implementation of early filtering techniques for any type of property constraint for which such early filtering can be derived. No matter how complex the constraint, one only needs to implement (1) an NFA-style state transition function and (2) a handful of functions that mirror those needed for user-defined aggregates. We show that when using ReCAP, a standard relational DBMS like DuckDB can effectively push property constraints deep into the query plan, beating the SOA graph and relational DBMS by a factor up to 400,000 over a variety of queries and input graphs.
Problem

Research questions and friction points this paper is trying to address.

path queries
property graphs
regular expressions
property constraints
early filtering
Innovation

Methods, ideas, or system contributions that make the work stand out.

ReCAP
early filtering
path query
property graph
relational DBMS
🔎 Similar Papers
No similar papers found.