Agrawal, P., Chen, J., & Jiang, N. (2021). Improved Worst-Case Regret Bounds for Randomized Least-Squares Value Iteration. Proceedings of the AAAI Conference on Artificial Intelligence, 35(8), 6566–6573. https://doi.org/10.1609/aaai.v35i8.16813