🤖 AI Summary
In Multi-Agent Epistemic Planning (MEP), representing belief states as Kripke structures impedes the design of effective traditional heuristics, resulting in poorly guided search and severely limited scalability.
Method: This paper introduces Graph Neural Networks (GNNs) to MEP for the first time, directly encoding Kripke models to learn relational patterns from structured belief states. A supervised learning framework is employed to predict state quality and generate data-driven, generalizable heuristic functions.
Contribution/Results: The approach overcomes the bottleneck of hand-crafted heuristic design, enabling end-to-end, learned heuristic guidance for search. Experiments on multiple benchmark domains demonstrate substantial improvements in solving efficiency and problem scale handled, validating both effectiveness and scalability of the method in complex multi-agent epistemic planning.
📝 Abstract
Multi-agent Epistemic Planning (MEP) is an autonomous planning framework for reasoning about both the physical world and the beliefs of agents, with applications in domains where information flow and awareness among agents are critical. The richness of MEP requires states to be represented as Kripke structures, i.e., directed labeled graphs. This representation limits the applicability of existing heuristics, hindering the scalability of epistemic solvers, which must explore an exponential search space without guidance, resulting often in intractability. To address this, we exploit Graph Neural Networks (GNNs) to learn patterns and relational structures within epistemic states, to guide the planning process. GNNs, which naturally capture the graph-like nature of Kripke models, allow us to derive meaningful estimates of state quality -- e.g., the distance from the nearest goal -- by generalizing knowledge obtained from previously solved planning instances. We integrate these predictive heuristics into an epistemic planning pipeline and evaluate them against standard baselines, showing significant improvements in the scalability of multi-agent epistemic planning.