Kar, A., and R. Singh. “Policy Zooming: Adaptive Discretization-Based Infinite-Horizon Average-Reward Reinforcement Learning”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 40, no. 27, Mar. 2026, pp. 22527-35, doi:10.1609/aaai.v40i27.39412.