[1]
Agrawal, P., Chen, J. and Jiang, N. 2021. Improved Worst-Case Regret Bounds for Randomized Least-Squares Value Iteration. Proceedings of the AAAI Conference on Artificial Intelligence. 35, 8 (May 2021), 6566-6573. DOI:https://doi.org/10.1609/aaai.v35i8.16813.