Context Uncertainty in Contextual Bandits with Applications to Recommender Systems
DOI:
https://doi.org/10.1609/aaai.v36i8.20831Keywords:
Machine Learning (ML), Reasoning Under Uncertainty (RU), Data Mining & Knowledge Management (DMKM), Domain(s) Of Application (APP)Abstract
Recurrent neural networks have proven effective in modeling sequential user feedbacks for recommender systems. However, they usually focus solely on item relevance and fail to effectively explore diverse items for users, therefore harming the system performance in the long run. To address this problem, we propose a new type of recurrent neural networks, dubbed recurrent exploration networks (REN), to jointly perform representation learning and effective exploration in the latent space. REN tries to balance relevance and exploration while taking into account the uncertainty in the representations. Our theoretical analysis shows that REN can preserve the rate-optimal sublinear regret even when there exists uncertainty in the learned representations. Our empirical study demonstrates that REN can achieve satisfactory long-term rewards on both synthetic and real-world recommendation datasets, outperforming state-of-the-art models.Downloads
Published
2022-06-28
How to Cite
Wang, H., Ma, Y., Ding, H., & Wang, Y. (2022). Context Uncertainty in Contextual Bandits with Applications to Recommender Systems. Proceedings of the AAAI Conference on Artificial Intelligence, 36(8), 8539-8547. https://doi.org/10.1609/aaai.v36i8.20831
Issue
Section
AAAI Technical Track on Machine Learning III