On Picking Good Policies: Leveraging Action-Policy Testing in Policy Training
DOI:
https://doi.org/10.1609/icaps.v35i1.36116Abstract
Testing is a natural approach to assess the quality of learned action policies π. Prior work introduced policy testing in AI planning as searching for bugs in π, that is, states where π is sub-optimal with respect to a given testing objective. Beyond quality assurance, an obvious application of these methods is policy selection: given several π to choose from, we can use testing to select the "least buggy" one. Here, we integrate testing-based policy selection into the training process. This includes making more informed decisions when selecting the final policy after training, as well as choosing more promising intermediate policies during the training process. Our experiments with ASNets action policies show that integrating testing allows us to more reliably obtain good-quality policies.Downloads
Published
2025-09-16
How to Cite
Eisenhut, J., Fišer, D., Valera, I., & Hoffmann, J. (2025). On Picking Good Policies: Leveraging Action-Policy Testing in Policy Training. Proceedings of the International Conference on Automated Planning and Scheduling, 35(1), 183–188. https://doi.org/10.1609/icaps.v35i1.36116
Issue
Section
Algorithmic papers