QIN, Yuzhen; LI, Yingcong; PASQUALETTI, Fabio; FAZEL, Maryam; OYMAK, Samet. Stochastic Contextual Bandits with Long Horizon Rewards. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 37, n. 8, p. 9525–9533, 2023. DOI: 10.1609/aaai.v37i8.26140. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/26140. Acesso em: 25 may. 2026.