Hung, Y.-H., and P.-C. Hsieh. “Reward-Biased Maximum Likelihood Estimation for Neural Contextual Bandits: A Distributional Learning Perspective”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 37, no. 7, June 2023, pp. 7944-52, doi:10.1609/aaai.v37i7.25961.