[1]
Mandal, D., Radanovic, G., Gan, J., Singla, A. and Majumdar, R. 2023. Online Reinforcement Learning with Uncertain Episode Lengths. Proceedings of the AAAI Conference on Artificial Intelligence. 37, 7 (Jun. 2023), 9064-9071. DOI:https://doi.org/10.1609/aaai.v37i7.26088.