Quantum Lipschitz Bandits

Authors

  • Bongsoo Yi University of North Carolina at Chapel Hill
  • Yue Kang Microsoft
  • Yao Li University of North Carolina at Chapel Hill

DOI:

https://doi.org/10.1609/aaai.v40i33.40007

Abstract

The Lipschitz bandit is a key variant of stochastic bandit problems where the expected reward function satisfies a Lipschitz condition with respect to an arm metric space. With its wide-ranging practical applications, various Lipschitz bandit algorithms have been developed, achieving the optimal regret performance in the classical setting. Motivated by recent advancements in quantum computing and the demonstrated success of quantum Monte Carlo in simpler bandit settings, we introduce the first quantum Lipschitz bandit algorithms to address the challenges of continuous action spaces and non-linear reward functions. Specifically, we first leverage the elimination-based framework to propose an efficient quantum Lipschitz bandit algorithm named Q-LAE. Next, we present novel modifications to the classical Zooming algorithm, which results in a simple quantum Lipschitz bandit method, Q-Zooming. Both algorithms exploit the computational power of quantum methods to obtain a provably improved regret bound over classical Lipschitz bandit algorithms. Comprehensive experiments further validate our improved theoretical findings, demonstrating superior empirical performance compared to existing Lipschitz bandit methods.

Downloads

Published

2026-03-14

How to Cite

Yi, B., Kang, Y., & Li, Y. (2026). Quantum Lipschitz Bandits. Proceedings of the AAAI Conference on Artificial Intelligence, 40(33), 27844-27851. https://doi.org/10.1609/aaai.v40i33.40007

Issue

Section

AAAI Technical Track on Machine Learning X