Mandal, D., Radanovic, G., Gan, J., Singla, A. and Majumdar, R. (2023) “Online Reinforcement Learning with Uncertain Episode Lengths”, Proceedings of the AAAI Conference on Artificial Intelligence, 37(7), pp. 9064-9071. doi: 10.1609/aaai.v37i7.26088.