Adversarial Linear Contextual Bandits with Graph-Structured Side Observations

Authors

  • Lingda Wang University of Illinois at Urbana-Champaign
  • Bingcong Li University of Minnesota - Twin Cities
  • Huozhi Zhou University of Illinois at Urbana-Champaign
  • Georgios B. Giannakis University of Minnesota - Twin Cities
  • Lav R. Varshney University of Illinois at Urbana-Champaign
  • Zhizhen Zhao University of Illinois at Urbana-Champaign

DOI:

https://doi.org/10.1609/aaai.v35i11.17218

Keywords:

Online Learning & Bandits

Abstract

This paper studies the adversarial graphical contextual bandits, a variant of adversarial multi-armed bandits that leverage two categories of the most common side information: contexts and side observations. In this setting, a learning agent repeatedly chooses from a set of K actions after being presented with a d-dimensional context vector. The agent not only incurs and observes the loss of the chosen action, but also observes the losses of its neighboring actions in the observation structures, which are encoded as a series of feedback graphs. This setting models a variety of applications in social networks, where both contexts and graph-structured side observations are available. Two efficient algorithms are developed based on EXP3. Under mild conditions, our analysis shows that for undirected feedback graphs the first algorithm, EXP3-LGC-U, achieves a sub-linear regret with respect to the time horizon and the average independence number of the feedback graphs. A slightly weaker result is presented for the directed graph setting as well. The second algorithm, EXP3-LGC-IX, is developed for a special class of problems, for which the regret is the same for both directed as well as undirected feedback graphs. Numerical tests corroborate the efficiency of proposed algorithms.

Downloads

Published

2021-05-18

How to Cite

Wang, L., Li, B., Zhou, H., Giannakis, G. B., Varshney, L. R., & Zhao, Z. (2021). Adversarial Linear Contextual Bandits with Graph-Structured Side Observations. Proceedings of the AAAI Conference on Artificial Intelligence, 35(11), 10156-10164. https://doi.org/10.1609/aaai.v35i11.17218

Issue

Section

AAAI Technical Track on Machine Learning IV