Tran-Thanh, Long, Archie Chapman, Enrique Munoz de Cote, Alex Rogers, and Nicholas R. Jennings. “Epsilon–First Policies for Budget–Limited Multi-Armed Bandits”. Proceedings of the AAAI Conference on Artificial Intelligence 24, no. 1 (July 4, 2010): 1211–1216. Accessed May 14, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/7758.