Context Uncertainty in Contextual Bandits with Applications to Recommender Systems

Authors

  • Hao Wang AWS AI Labs Rutgers University
  • Yifei Ma AWS AI Labs
  • Hao Ding AWS AI Labs
  • Yuyang Wang AWS AI Labs

DOI:

https://doi.org/10.1609/aaai.v36i8.20831

Keywords:

Machine Learning (ML), Reasoning Under Uncertainty (RU), Data Mining & Knowledge Management (DMKM), Domain(s) Of Application (APP)

Abstract

Recurrent neural networks have proven effective in modeling sequential user feedbacks for recommender systems. However, they usually focus solely on item relevance and fail to effectively explore diverse items for users, therefore harming the system performance in the long run. To address this problem, we propose a new type of recurrent neural networks, dubbed recurrent exploration networks (REN), to jointly perform representation learning and effective exploration in the latent space. REN tries to balance relevance and exploration while taking into account the uncertainty in the representations. Our theoretical analysis shows that REN can preserve the rate-optimal sublinear regret even when there exists uncertainty in the learned representations. Our empirical study demonstrates that REN can achieve satisfactory long-term rewards on both synthetic and real-world recommendation datasets, outperforming state-of-the-art models.

Downloads

Published

2022-06-28

How to Cite

Wang, H., Ma, Y., Ding, H., & Wang, Y. (2022). Context Uncertainty in Contextual Bandits with Applications to Recommender Systems. Proceedings of the AAAI Conference on Artificial Intelligence, 36(8), 8539-8547. https://doi.org/10.1609/aaai.v36i8.20831

Issue

Section

AAAI Technical Track on Machine Learning III