1.
Hung Y-H, Hsieh P-C, Liu X, Kumar PR. Reward-Biased Maximum Likelihood Estimation for Linear Stochastic Bandits. AAAI [Internet]. 2021May18 [cited 2024Mar.28];35(9):7874-82. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/16961