Towards Fault-Tolerant Federated and Distributed Machine Learning

Authors

  • Sanmi Koyejo Stanford University

DOI:

https://doi.org/10.1609/aaaiss.v3i1.31220

Keywords:

ML: Distributed Machine Learning & Federated Learning, Collaborative Learning, Learning On The Edge

Abstract

Machine learning (ML) models are routinely trained and deployed among distributed devices, e.g., learning with geo-distributed data centers and federated learning with mobile devices. Such shared computing platforms are susceptible to hardware, software, communication errors, and security concerns. This talk will outline some of the threat models in distributed learning, along with robust learning methods proposed to augment the fault tolerance of distributed machine learning, showing both theoretical and empirical evidence of robustness to benign and adversarial attacks.

Downloads

Published

2024-05-20