Exploring the Efficacy of Multi-Agent Reinforcement Learning for Autonomous Cyber Defence: A CAGE Challenge 4 Perspective

Mitchell Kiely; Metin Ahiskali; Etienne Borde; Benjamin Bowman; David Bowman; Dirk van Bruggen; KC Cowan; Prithviraj Dasgupta; Erich Devendorf; Ben Edwards; Alex Fitts; Sunny Fugate; Ryan Gabrys; Wayne Gould; H. Howie Huang; Jules Jacobs; Ryan Kerr; Isaiah J. King; Li Li; Luis Martinez; Christopher Moir; Craig Murphy; Olivia Naish; Claire Owens; Miranda Purchase; Ahmad Ridley; Adrian Taylor; Sara Farmer; William John Valentine; Yiyi Zhang

doi:10.1609/aaai.v39i28.35158

Authors

Mitchell Kiely Defence Science & Technology Group (DSTG), Australia.
Metin Ahiskali Army Combat Capabilities Development Command (DEVCOM), USA.
Etienne Borde University of Canterbury, New Zealand.
Benjamin Bowman Cybermonic, USA.
David Bowman Defence Science & Technology Group (DSTG), Australia.
Dirk van Bruggen Punch Cyber Analytics, USA.
KC Cowan Army Combat Capabilities Development Command (DEVCOM), USA.
Prithviraj Dasgupta Naval Research Laboratory (NRL), USA.
Erich Devendorf Air Force Research Laboratory (AFRL), USA.
Ben Edwards Defence Science Technology Laboratory (Dstl), United Kingdom.
Alex Fitts Punch Cyber Analytics, USA.
Sunny Fugate Naval Information Warfare Center Pacific, USA.
Ryan Gabrys Naval Information Warfare Center Pacific, USA.
Wayne Gould Defence Science Technology Laboratory (Dstl), United Kingdom.
H. Howie Huang Cybermonic, USA.
Jules Jacobs Cornell University, USA.
Ryan Kerr Defence Research and Development Canada (DRDC), Canada.
Isaiah J. King Cybermonic, USA.
Li Li Defence Research and Development Canada (DRDC), Canada.
Luis Martinez Naval Information Warfare Center Pacific, USA.
Christopher Moir Defence Science & Technology Group (DSTG), Australia.
Craig Murphy Defence Science Technology Laboratory (Dstl), United Kingdom.
Olivia Naish Defence Science Technology Laboratory (Dstl), United Kingdom.
Claire Owens Defence Science Technology Laboratory (Dstl), United Kingdom.
Miranda Purchase Defence Science Technology Laboratory (Dstl), United Kingdom.
Ahmad Ridley National Security Agency (NSA), USA.
Adrian Taylor Defence Research and Development Canada (DRDC), Canada.
Sara Farmer Defence Science Technology Laboratory (Dstl), United Kingdom.
William John Valentine University of Canterbury, New Zealand.
Yiyi Zhang Cornell University, USA.

DOI:

https://doi.org/10.1609/aaai.v39i28.35158

Abstract

As cyber threats become increasingly automated and sophisticated, novel solutions must be introduced to improve defence of enterprise networks. Deep Reinforcement Learning (DRL) has demonstrated potential in mitigating these advanced threats. Single DRL Agents have proven utility toward execution of autonomous cyber defence. Despite the success of employing single DRL Agents, this approach presents significant limitations, especially regarding scalability within large enterprise networks. An attractive alternative to the single agent approach is the use of Multi-Agent Reinforcement Learning (MARL). However, developing MARL agents is costly with few options for examining MARL cyber defence techniques against adversarial agents. This paper presents a MARL network security environment, the fourth iteration of the Cyber Autonomy Gym for Experimentation (CAGE) challenges. This challenge was specifically designed to test the efficacy of MARL algorithms in an enterprise network. Our work aims to evaluate the potential of MARL as a robust and scalable solution for autonomous network defence.

Exploring the Efficacy of Multi-Agent Reinforcement Learning for Autonomous Cyber Defence: A CAGE Challenge 4 Perspective

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information