On the Robustness of Bandit Multiple Testing

Authors

  • Zhengyu Zhou Wuhan University
  • Weiwei Liu Wuhan University

DOI:

https://doi.org/10.1609/aaai.v40i34.40148

Abstract

Bandit multiple hypothesis testing has broad applications in biological sciences, clinical testing for drug discovery, and online A/B/n testing. The framework utilizes an adaptive sampling strategy for multiple testing which aims to maximize statistical power while ensuring anytime false discovery rate control. This paper proposes a robust approach for bandit multiple testing, allowing for at most an epsilon fraction of arbitrary distribution corruption, as in Huber’s contamination model. Specifically, we introduce two adaptive sampling strategies designed to minimize the number of samples required to exceed a target true positive rate, while providing anytime control over the false discovery rate. We analyze the sample complexity of our proposed methods and perform numerical simulations to demonstrate their efficiency and robustness. Furthermore, we extend our methods to address scenarios where distributions have infinite variance and situations involving multiple agents collaborating on the same bandit task.

Downloads

Published

2026-03-14

How to Cite

Zhou, Z., & Liu, W. (2026). On the Robustness of Bandit Multiple Testing. Proceedings of the AAAI Conference on Artificial Intelligence, 40(34), 29107–29114. https://doi.org/10.1609/aaai.v40i34.40148

Issue

Section

AAAI Technical Track on Machine Learning XI