1.
Qin Y, Li Y, Pasqualetti F, Fazel M, Oymak S. Stochastic Contextual Bandits with Long Horizon Rewards. AAAI [Internet]. 2023Jun.26 [cited 2024Jul.15];37(8):9525-33. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/26140