Constrained and Robust Policy Synthesis with Satisfiability-Modulo-Probabilistic-Model-Checking

Authors

  • Linus Heck Radboud University
  • Filip Macák Brno University of Technology
  • Milan Češka Brno University of Technology
  • Sebastian Junges Radboud University

DOI:

https://doi.org/10.1609/aaai.v40i43.40944

Abstract

The ability to compute reward-optimal policies for given and known finite Markov decision processes (MDPs) underpins a variety of applications across planning, controller synthesis, and verification. However, we often want policies (1) to be robust, i.e., they perform well on perturbations of the MDP and (2) to satisfy additional structural constraints regarding, e.g., their representation or implementation cost. Computing such robust and constrained policies is indeed computationally more challenging. This paper contributes the first approach to effectively compute robust policies subject to arbitrary structural constraints using a flexible and efficient framework. We achieve flexibility by allowing to express our constraints in a first-order theory over a set of MDPs, while the root for our efficiency lies in the tight integration of satisfiability solvers to handle the combinatorial nature of the problem and probabilistic model checking algorithms to handle the analysis of MDPs. Experiments on a few hundred benchmarks demonstrate the feasibility for constrained and robust policy synthesis and the competitiveness with state-of-the-art methods for various fragments of the problem.

Published

2026-03-14

How to Cite

Heck, L., Macák, F., Češka, M., & Junges, S. (2026). Constrained and Robust Policy Synthesis with Satisfiability-Modulo-Probabilistic-Model-Checking. Proceedings of the AAAI Conference on Artificial Intelligence, 40(43), 36253–36261. https://doi.org/10.1609/aaai.v40i43.40944

Issue

Section

AAAI Technical Track on Planning, Routing, and Scheduling