Context Uncertainty in Contextual Bandits with Applications to Recommender Systems

Hao Wang; Yifei Ma; Hao Ding; Yuyang Wang

doi:10.1609/aaai.v36i8.20831

Authors

Hao Wang AWS AI Labs Rutgers University
Yifei Ma AWS AI Labs
Hao Ding AWS AI Labs
Yuyang Wang AWS AI Labs

DOI:

https://doi.org/10.1609/aaai.v36i8.20831

Keywords:

Machine Learning (ML), Reasoning Under Uncertainty (RU), Data Mining & Knowledge Management (DMKM), Domain(s) Of Application (APP)

Abstract

Recurrent neural networks have proven effective in modeling sequential user feedbacks for recommender systems. However, they usually focus solely on item relevance and fail to effectively explore diverse items for users, therefore harming the system performance in the long run. To address this problem, we propose a new type of recurrent neural networks, dubbed recurrent exploration networks (REN), to jointly perform representation learning and effective exploration in the latent space. REN tries to balance relevance and exploration while taking into account the uncertainty in the representations. Our theoretical analysis shows that REN can preserve the rate-optimal sublinear regret even when there exists uncertainty in the learned representations. Our empirical study demonstrates that REN can achieve satisfactory long-term rewards on both synthetic and real-world recommendation datasets, outperforming state-of-the-art models.

Context Uncertainty in Contextual Bandits with Applications to Recommender Systems

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Subscription