Hung, Y.-H. and Hsieh, P.-C. (2023) “Reward-Biased Maximum Likelihood Estimation for Neural Contextual Bandits: A Distributional Learning Perspective”, Proceedings of the AAAI Conference on Artificial Intelligence, 37(7), pp. 7944-7952. doi: 10.1609/aaai.v37i7.25961.