Instance-Sensitive Algorithms for Pure Exploration in Multinomial Logit Bandit

Authors

  • Nikolai Karpov Indiana University Bloomington
  • Qin Zhang Indiana University Bloomington

DOI:

https://doi.org/10.1609/aaai.v36i7.20669

Keywords:

Machine Learning (ML)

Abstract

Motivated by real-world applications such as fast fashion retailing and online advertising, the Multinomial Logit Bandit (MNL-bandit) is a popular model in online learning and operations research, and has attracted much attention in the past decade. In this paper, we give efficient algorithms for pure exploration in MNL-bandit. Our algorithms achieve instance-sensitive pull complexities. We also complement the upper bounds by an almost matching lower bound.

Downloads

Published

2022-06-28

How to Cite

Karpov, N., & Zhang, Q. (2022). Instance-Sensitive Algorithms for Pure Exploration in Multinomial Logit Bandit. Proceedings of the AAAI Conference on Artificial Intelligence, 36(7), 7096-7103. https://doi.org/10.1609/aaai.v36i7.20669

Issue

Section

AAAI Technical Track on Machine Learning II