[1]

Y.-H. Hung, P.-C. Hsieh, X. Liu, and P. R. Kumar, “Reward-Biased Maximum Likelihood Estimation for Linear Stochastic Bandits”, AAAI, vol. 35, no. 9, pp. 7874-7882, May 2021.