Federated X-armed Bandit
DOI:
https://doi.org/10.1609/aaai.v38i12.29267Keywords:
ML: Online Learning & Bandits, ML: Auto ML and Hyperparameter Tuning, ML: Learning Theory, ML: Optimization, ML: Reinforcement LearningAbstract
This work establishes the first framework of federated X-armed bandit, where different clients face heterogeneous local objective functions defined on the same domain and are required to collaboratively figure out the global optimum. We propose the first federated algorithm for such problems, named Fed-PNE. By utilizing the topological structure of the global objective inside the hierarchical partitioning and the weak smoothness property, our algorithm achieves sublinear cumulative regret with respect to both the number of clients and the evaluation budget. Meanwhile, it only requires logarithmic communications between the central server and clients, protecting the client privacy. Experimental results on synthetic functions and real datasets validate the advantages of Fed-PNE over various centralized and federated baseline algorithms.Downloads
Published
2024-03-24
How to Cite
Li, W., Song, Q., Honorio, J., & Lin, G. (2024). Federated X-armed Bandit. Proceedings of the AAAI Conference on Artificial Intelligence, 38(12), 13628-13636. https://doi.org/10.1609/aaai.v38i12.29267
Issue
Section
AAAI Technical Track on Machine Learning III