Federated X-armed Bandit

Authors

  • Wenjie Li Department of Statistics, Purdue University
  • Qifan Song Department of Statistics, Purdue University
  • Jean Honorio School of Computing and Information Systems, The University of Melbourne
  • Guang Lin Departments of Mathematics and School of Mechanical Engineering, Purdue University

DOI:

https://doi.org/10.1609/aaai.v38i12.29267

Keywords:

ML: Online Learning & Bandits, ML: Auto ML and Hyperparameter Tuning, ML: Learning Theory, ML: Optimization, ML: Reinforcement Learning

Abstract

This work establishes the first framework of federated X-armed bandit, where different clients face heterogeneous local objective functions defined on the same domain and are required to collaboratively figure out the global optimum. We propose the first federated algorithm for such problems, named Fed-PNE. By utilizing the topological structure of the global objective inside the hierarchical partitioning and the weak smoothness property, our algorithm achieves sublinear cumulative regret with respect to both the number of clients and the evaluation budget. Meanwhile, it only requires logarithmic communications between the central server and clients, protecting the client privacy. Experimental results on synthetic functions and real datasets validate the advantages of Fed-PNE over various centralized and federated baseline algorithms.

Published

2024-03-24

How to Cite

Li, W., Song, Q., Honorio, J., & Lin, G. (2024). Federated X-armed Bandit. Proceedings of the AAAI Conference on Artificial Intelligence, 38(12), 13628-13636. https://doi.org/10.1609/aaai.v38i12.29267

Issue

Section

AAAI Technical Track on Machine Learning III