Formally Verified Approximate Policy Iteration

Maximilian Schäffeler; Mohammad Abdulaziz

doi:10.1609/aaai.v39i25.34868

Formally Verified Approximate Policy Iteration

Authors

Maximilian Schäffeler Technische Universität München
Mohammad Abdulaziz King's College London

DOI:

https://doi.org/10.1609/aaai.v39i25.34868

Abstract

We present a methodology based on interactive theorem proving that facilitates the development of verified implementations of algorithms for solving factored Markov Decision Processes. As a case study, we formally verify an algorithm for approximate policy iteration in the proof assistant Isabelle/HOL. We show how the verified algorithm can be refined to an executable, verified implementation. Our evaluation on benchmark problems shows that it is practical. As part of the development, we build verified software to certify linear programming solutions. We discuss the verification process and the modifications we made to the algorithm during formalization.

AAAI-25 / IAAI-25 / EAAI-25 Proceedings Cover

Downloads

Published

2025-04-11

How to Cite

Schäffeler, M., & Abdulaziz, M. (2025). Formally Verified Approximate Policy Iteration. Proceedings of the AAAI Conference on Artificial Intelligence, 39(25), 26659–26667. https://doi.org/10.1609/aaai.v39i25.34868

Download Citation

Issue

Vol. 39 No. 25: AAAI-25 Technical Tracks 25

Section

AAAI Technical Track on Planning, Routing, and Scheduling

Formally Verified Approximate Policy Iteration

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information