A Finite-State Controller Based Offline Solver for Deterministic POMDPs

📅 2025-05-01

📈 Citations: 0

✨ Influential: 0

🤖 AI Summary

This work addresses offline planning for deterministic partially observable Markov decision processes (DetPOMDPs)—a setting where the agent’s state is uncertain, yet both transitions and observations are deterministic. We propose the first framework adapting Monte Carlo value iteration (MCVI) to DetPOMDPs, explicitly leveraging determinism in dynamics and observations to construct compact, interpretable finite-state controllers (FSCs). Our method significantly improves solution success rates and robustness on large-scale DetPOMDPs. Empirically, it outperforms existing approaches on standard benchmarks and a real-world mobile-robot mapping task in forest environments, achieving both high success probability and strong generalization across unseen scenarios.

Technology Category

Application Category

📝 Abstract

Deterministic partially observable Markov decision processes (DetPOMDPs) often arise in planning problems where the agent is uncertain about its environmental state but can act and observe deterministically. In this paper, we propose DetMCVI, an adaptation of the Monte Carlo Value Iteration (MCVI) algorithm for DetPOMDPs, which builds policies in the form of finite-state controllers (FSCs). DetMCVI solves large problems with a high success rate, outperforming existing baselines for DetPOMDPs. We also verify the performance of the algorithm in a real-world mobile robot forest mapping scenario.

Problem

Research questions and friction points this paper is trying to address.

Solves deterministic POMDPs with uncertain environmental states

Proposes DetMCVI for building finite-state controller policies

Outperforms baselines in large problems and robot mapping

Innovation

Methods, ideas, or system contributions that make the work stand out.

Adapts MCVI algorithm for DetPOMDPs

Uses finite-state controllers for policies

Solves large problems with high success

🔎 Similar Papers

No similar papers found.

Authors to Follow