Agarwal, P., Wajid, M. S., & Kalyanakrishnan, S. (2025). An Improved Lower Bound on the Length of Locally-Improving Policy Sequences in MDPs with Large Action Sets. Proceedings of the International Conference on Automated Planning and Scheduling, 35(1), 2-10. https://doi.org/10.1609/icaps.v35i1.36095