🤖 AI Summary
Large language models (LLMs) struggle to accurately comprehend and generate qualitative spatial relations—such as “turn left,” “adjacent to,” or “cross”—in pedestrian wayfinding tasks. To address this, we propose a novel framework integrating qualitative spatial reasoning with graph-structured retrieval-augmented generation (RAG). Our approach innovatively embeds the RCC-8 and OPRA qualitative spatial calculi into street-network topological graphs, enabling semantic-aligned bidirectional spatial reasoning. Coupled with graph neural networks, it supports structured modeling of spatial relations and context-aware generation. Evaluated on real-world urban pedestrian navigation benchmarks, our method achieves a 27% improvement in instruction accuracy and reduces spatial relation errors by 41% over state-of-the-art LLM baselines. This work establishes a new paradigm for interpretable and robust route guidance in embodied navigation, bridging symbolic spatial reasoning with neural language generation.
📝 Abstract
This paper deals with improving the capabilities of Large Language Models (LLM) to provide route instructions for pedestrian wayfinders by means of qualitative spatial relations.