CHANG, J. Q. L.; TAN, V. Y. F. A Unifying Theory of Thompson Sampling for Continuous Risk-Averse Bandits. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 36, n. 6, p. 6159-6166, 2022. DOI: 10.1609/aaai.v36i6.20564. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/20564. Acesso em: 25 apr. 2026.