[1]

Hung, Y.-H. and Hsieh, P.-C. 2023. Reward-Biased Maximum Likelihood Estimation for Neural Contextual Bandits: A Distributional Learning Perspective. Proceedings of the AAAI Conference on Artificial Intelligence. 37, 7 (Jun. 2023), 7944-7952. DOI:https://doi.org/10.1609/aaai.v37i7.25961.