High Dimensional Distributed Gradient Descent with Arbitrary Number of Byzantine Attackers

Authors

  • Wenyu Liu College of Computer and Cyber Security, Fujian Normal University
  • Tianqiang Huang College of Computer and Cyber Security, Fujian Normal University
  • Pengfei Zhang Anhui University of Science and Technology
  • Zong Ke National University of Singapore
  • Minghui Min China University of Mining Technology
  • Puning Zhao School of Cyber Science and Technology, Sun Yat-sen University

DOI:

https://doi.org/10.1609/aaai.v40i28.39560

Abstract

Adversarial attacks pose a major challenge to distributed learning systems, prompting the development of numerous robust learning methods. However, most existing approaches suffer from the curse of dimensionality, i.e. the error increases with the number of model parameters. In this paper, we make a progress towards high dimensional problems, under arbitrary number of Byzantine attackers. The cornerstone of our design is a direct high dimensional semi-verified mean estimation method. The idea is to identify a subspace with large variance. The components of the mean value perpendicular to this subspace are estimated using corrupted gradient vectors uploaded from worker machines, while the components within this subspace are estimated using auxiliary dataset. As a result, a combination of large corrupted dataset and small clean dataset yields significantly better performance than using them separately. We then apply this method as the aggregator for distributed learning problems. The theoretical analysis shows that compared with existing solutions, our method gets rid of sqrt{d} dependence on the dimensionality, and achieves minimax optimal statistical rates. Numerical results validate our theory as well as the effectiveness of the proposed method.

Downloads

Published

2026-03-14

How to Cite

Liu, W., Huang, T., Zhang, P., Ke, Z., Min, M., & Zhao, P. (2026). High Dimensional Distributed Gradient Descent with Arbitrary Number of Byzantine Attackers. Proceedings of the AAAI Conference on Artificial Intelligence, 40(28), 23846–23854. https://doi.org/10.1609/aaai.v40i28.39560

Issue

Section

AAAI Technical Track on Machine Learning V