AGRAWAL, P.; CHEN, J.; JIANG, N. Improved Worst-Case Regret Bounds for Randomized Least-Squares Value Iteration. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 35, n. 8, p. 6566-6573, 2021. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/16813. Acesso em: 18 jan. 2022.