Formally Verified Approximate Policy Iteration

Authors

  • Maximilian Schäffeler Technische Universität München
  • Mohammad Abdulaziz King's College London

DOI:

https://doi.org/10.1609/aaai.v39i25.34868

Abstract

We present a methodology based on interactive theorem proving that facilitates the development of verified implementations of algorithms for solving factored Markov Decision Processes. As a case study, we formally verify an algorithm for approximate policy iteration in the proof assistant Isabelle/HOL. We show how the verified algorithm can be refined to an executable, verified implementation. Our evaluation on benchmark problems shows that it is practical. As part of the development, we build verified software to certify linear programming solutions. We discuss the verification process and the modifications we made to the algorithm during formalization.

Downloads

Published

2025-04-11

How to Cite

Schäffeler, M., & Abdulaziz, M. (2025). Formally Verified Approximate Policy Iteration. Proceedings of the AAAI Conference on Artificial Intelligence, 39(25), 26659–26667. https://doi.org/10.1609/aaai.v39i25.34868

Issue

Section

AAAI Technical Track on Planning, Routing, and Scheduling