Dukkipati, Ambedkar, Ranga Shaarad Ayyagari, Bodhisattwa Dasgupta, Parag Dutta, and Prabhas Reddy Onteru. 2025. “Active Reinforcement Learning Strategies for Offline Policy Improvement”. Proceedings of the AAAI Conference on Artificial Intelligence 39 (16):16418-25. https://doi.org/10.1609/aaai.v39i16.33803.