Federated X-armed Bandit

Wenjie Li; Qifan Song; Jean Honorio; Guang Lin

doi:10.1609/aaai.v38i12.29267

Authors

Wenjie Li Department of Statistics, Purdue University
Qifan Song Department of Statistics, Purdue University
Jean Honorio School of Computing and Information Systems, The University of Melbourne
Guang Lin Departments of Mathematics and School of Mechanical Engineering, Purdue University

DOI:

https://doi.org/10.1609/aaai.v38i12.29267

Keywords:

ML: Online Learning & Bandits, ML: Auto ML and Hyperparameter Tuning, ML: Learning Theory, ML: Optimization, ML: Reinforcement Learning

Abstract

This work establishes the first framework of federated X-armed bandit, where different clients face heterogeneous local objective functions defined on the same domain and are required to collaboratively figure out the global optimum. We propose the first federated algorithm for such problems, named Fed-PNE. By utilizing the topological structure of the global objective inside the hierarchical partitioning and the weak smoothness property, our algorithm achieves sublinear cumulative regret with respect to both the number of clients and the evaluation budget. Meanwhile, it only requires logarithmic communications between the central server and clients, protecting the client privacy. Experimental results on synthetic functions and real datasets validate the advantages of Fed-PNE over various centralized and federated baseline algorithms.

Federated X-armed Bandit

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription