Towards Robust Multi-Agent Reinforcement Learning

Authors

  • Aritra Mitra North Carolina State University

DOI:

https://doi.org/10.1609/aaaiss.v3i1.31222

Keywords:

Distributed Machine Learning & Federated Learning, Multi-Agent Reinforcement Learning, Temporal Difference Learning

Abstract

Stochastic gradient descent (SGD) is at the heart of large-scale distributed machine learning paradigms such as federated learning (FL). In these applications, the task of training high-dimensional weight vectors is distributed among several workers that exchange information over networks of limited bandwidth. While parallelization at such an immense scale helps to reduce the computational burden, it creates several other challenges: delays, asynchrony, and most importantly, a significant communication bottleneck. The popularity and success of SGD can be attributed in no small part to the fact that it is extremely robust to such deviations from ideal operating conditions. Inspired by these findings, we ask: Are common reinforcement learning (RL) algorithms also robust to similarly structured perturbations? Perhaps surprisingly, despite the recent surge of interest in multi-agent/federated RL, almost nothing is known about the above question. This paper collects some of our recent results in filling this void.

Downloads

Published

2024-05-20