Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users
DOI:
https://doi.org/10.1609/aaai.v38i18.30045Keywords:
RU: Sequential Decision Making, ML: Online Learning & BanditsAbstract
We study the problem of federated contextual combinatorial cascading bandits, where agents collaborate under the coordination of a central server to provide tailored recommendations to users. Existing works consider either a synchronous framework, necessitating full agent participation and global synchronization, or assume user homogeneity with identical behaviors. We overcome these limitations by considering (1) federated agents operating in an asynchronous communication paradigm, where no mandatory synchronization is required and all agents communicate independently with the server, (2) heterogeneous user behaviors, where users can be stratified into latent user clusters, each exhibiting distinct preferences. For this setting, we propose a UCB-type algorithm with delicate communication protocols. Through theoretical analysis, we give sub-linear regret bounds on par with those achieved in the synchronous framework, while incurring only logarithmic communication costs. Empirical evaluation on synthetic and real-world datasets validates our algorithm's superior performance in terms of regrets and communication costs.Downloads
Published
2024-03-24
How to Cite
Yang, H., Liu, X., Wang, Z., Xie, H., Lui, J. C. S., Lian, D., & Chen, E. (2024). Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users. Proceedings of the AAAI Conference on Artificial Intelligence, 38(18), 20596-20603. https://doi.org/10.1609/aaai.v38i18.30045
Issue
Section
AAAI Technical Track on Reasoning under Uncertainty