New Classes of the Greedy-Applicable Arm Feature Distributions in the Sparse Linear Bandit Problem

Authors

  • Koji Ichikawa NEC Corporation National Institute of Advanced Industrial Science and Technology
  • Shinji Ito NEC Corporation National Institute of Advanced Industrial Science and Technology RIKEN AIP
  • Daisuke Hatano RIKEN AIP
  • Hanna Sumita Tokyo Institute of Technology
  • Takuro Fukunaga Chuo University
  • Naonori Kakimura Keio University
  • Ken-ichi Kawarabayashi National Institute of Informatics The University of Tokyo

DOI:

https://doi.org/10.1609/aaai.v38i11.29166

Keywords:

ML: Online Learning & Bandits, ML: Learning Theory

Abstract

We consider the sparse contextual bandit problem where arm feature affects reward through the inner product of sparse parameters. Recent studies have developed sparsity-agnostic algorithms based on the greedy arm selection policy. However, the analysis of these algorithms requires strong assumptions on the arm feature distribution to ensure that the greedily selected samples are sufficiently diverse; One of the most common assumptions, relaxed symmetry, imposes approximate origin-symmetry on the distribution, which cannot allow distributions that has origin-asymmetric support. In this paper, we show that the greedy algorithm is applicable to a wider range of the arm feature distributions from two aspects. Firstly, we show that a mixture distribution that has a greedy-applicable component is also greedy-applicable. Second, we propose new distribution classes, related to Gaussian mixture, discrete, and radial distribution, for which the sample diversity is guaranteed. The proposed classes can describe distributions with origin-asymmetric support and, in conjunction with the first claim, provide theoretical guarantees of the greedy policy for a very wide range of the arm feature distributions.

Published

2024-03-24

How to Cite

Ichikawa, K., Ito, S., Hatano, D., Sumita, H., Fukunaga, T., Kakimura, N., & Kawarabayashi, K.- ichi. (2024). New Classes of the Greedy-Applicable Arm Feature Distributions in the Sparse Linear Bandit Problem. Proceedings of the AAAI Conference on Artificial Intelligence, 38(11), 12708–12716. https://doi.org/10.1609/aaai.v38i11.29166

Issue

Section

AAAI Technical Track on Machine Learning II