On the Role of Server Momentum in Federated Learning

Authors

  • Jianhui Sun Computer Science, University of Virginia, VA, USA
  • Xidong Wu Electrical and Computer Engineering, University of Pittsburgh, PA, USA
  • Heng Huang Computer Science, University of Maryland College Park, MD, USA
  • Aidong Zhang Computer Science, University of Virginia, VA, USA

DOI:

https://doi.org/10.1609/aaai.v38i13.29439

Keywords:

ML: Distributed Machine Learning & Federated Learning, ML: Optimization

Abstract

Federated Averaging (FedAvg) is known to experience convergence issues when encountering significant clients system heterogeneity and data heterogeneity. Server momentum has been proposed as an effective mitigation. However, existing server momentum works are restrictive in the momentum formulation, do not properly schedule hyperparameters and focus only on system homogeneous settings, which leaves the role of server momentum still an under-explored problem. In this paper, we propose a general framework for server momentum, that (a) covers a large class of momentum schemes that are unexplored in federated learning (FL), (b) enables a popular stagewise hyperparameter scheduler, (c) allows heterogeneous and asynchronous local computing. We provide rigorous convergence analysis for the proposed framework. To our best knowledge, this is the first work that thoroughly analyzes the performances of server momentum with a hyperparameter scheduler and system heterogeneity. Extensive experiments validate the effectiveness of our proposed framework. Due to page limit, we leave all proofs to the full version https://arxiv.org/abs/2312.12670.

Downloads

Published

2024-03-24

How to Cite

Sun, J., Wu, X., Huang, H., & Zhang, A. (2024). On the Role of Server Momentum in Federated Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 38(13), 15164-15172. https://doi.org/10.1609/aaai.v38i13.29439

Issue

Section

AAAI Technical Track on Machine Learning IV