Hung, Yu-Heng, and Ping-Chun Hsieh. 2023. “Reward-Biased Maximum Likelihood Estimation for Neural Contextual Bandits: A Distributional Learning Perspective”. Proceedings of the AAAI Conference on Artificial Intelligence 37 (7):7944-52. https://doi.org/10.1609/aaai.v37i7.25961.