Chang, Joel Q. L., and Vincent Y. F. Tan. “A Unifying Theory of Thompson Sampling for Continuous Risk-Averse Bandits”. Proceedings of the AAAI Conference on Artificial Intelligence 36, no. 6 (June 28, 2022): 6159-6166. Accessed April 30, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/20564.