Revelations: A Decidable Class of POMDPs with Omega-Regular Objectives

Authors

  • Marius Belly CNRS, LaBRI, Université de Bordeaux, France
  • Nathanaël Fijalkow CNRS, LaBRI, Université de Bordeaux, France
  • Hugo Gimbert CNRS, LaBRI, Université de Bordeaux, France
  • Florian Horn CNRS, IRIF, Université de Paris, France
  • Guillermo A. Pérez University of Antwerp – Flanders Make, Antwerp, Belgium
  • Pierre Vandenhove CNRS, LaBRI, Université de Bordeaux, France

DOI:

https://doi.org/10.1609/aaai.v39i25.34845

Abstract

Partially observable Markov decision processes (POMDPs) form a prominent model for uncertainty in sequential decision making. We are interested in constructing algorithms with theoretical guarantees to determine whether the agent has a strategy ensuring a given specification with probability 1. This well-studied problem is known to be undecidable already for very simple omega-regular objectives, because of the difficulty of reasoning on uncertain events. We introduce a revelation mechanism which restricts information loss by requiring that almost surely the agent has eventually full information of the current state. Our main technical results are to construct exact algorithms for two classes of POMDPs called weakly and strongly revealing. Importantly, the decidable cases reduce to the analysis of a finite belief-support Markov decision process. This yields a conceptually simple and exact algorithm for a large class of POMDPs.

Published

2025-04-11

How to Cite

Belly, M., Fijalkow, N., Gimbert, H., Horn, F., Pérez, G. A., & Vandenhove, P. (2025). Revelations: A Decidable Class of POMDPs with Omega-Regular Objectives. Proceedings of the AAAI Conference on Artificial Intelligence, 39(25), 26454–26462. https://doi.org/10.1609/aaai.v39i25.34845

Issue

Section

AAAI Technical Track on Planning, Routing, and Scheduling