Exploring the Efficacy of Multi-Agent Reinforcement Learning for Autonomous Cyber Defence: A CAGE Challenge 4 Perspective

Authors

  • Mitchell Kiely Defence Science & Technology Group (DSTG), Australia.
  • Metin Ahiskali Army Combat Capabilities Development Command (DEVCOM), USA.
  • Etienne Borde University of Canterbury, New Zealand.
  • Benjamin Bowman Cybermonic, USA.
  • David Bowman Defence Science & Technology Group (DSTG), Australia.
  • Dirk van Bruggen Punch Cyber Analytics, USA.
  • KC Cowan Army Combat Capabilities Development Command (DEVCOM), USA.
  • Prithviraj Dasgupta Naval Research Laboratory (NRL), USA.
  • Erich Devendorf Air Force Research Laboratory (AFRL), USA.
  • Ben Edwards Defence Science Technology Laboratory (Dstl), United Kingdom.
  • Alex Fitts Punch Cyber Analytics, USA.
  • Sunny Fugate Naval Information Warfare Center Pacific, USA.
  • Ryan Gabrys Naval Information Warfare Center Pacific, USA.
  • Wayne Gould Defence Science Technology Laboratory (Dstl), United Kingdom.
  • H. Howie Huang Cybermonic, USA.
  • Jules Jacobs Cornell University, USA.
  • Ryan Kerr Defence Research and Development Canada (DRDC), Canada.
  • Isaiah J. King Cybermonic, USA.
  • Li Li Defence Research and Development Canada (DRDC), Canada.
  • Luis Martinez Naval Information Warfare Center Pacific, USA.
  • Christopher Moir Defence Science & Technology Group (DSTG), Australia.
  • Craig Murphy Defence Science Technology Laboratory (Dstl), United Kingdom.
  • Olivia Naish Defence Science Technology Laboratory (Dstl), United Kingdom.
  • Claire Owens Defence Science Technology Laboratory (Dstl), United Kingdom.
  • Miranda Purchase Defence Science Technology Laboratory (Dstl), United Kingdom.
  • Ahmad Ridley National Security Agency (NSA), USA.
  • Adrian Taylor Defence Research and Development Canada (DRDC), Canada.
  • Sara Farmer Defence Science Technology Laboratory (Dstl), United Kingdom.
  • William John Valentine University of Canterbury, New Zealand.
  • Yiyi Zhang Cornell University, USA.

DOI:

https://doi.org/10.1609/aaai.v39i28.35158

Abstract

As cyber threats become increasingly automated and sophisticated, novel solutions must be introduced to improve defence of enterprise networks. Deep Reinforcement Learning (DRL) has demonstrated potential in mitigating these advanced threats. Single DRL Agents have proven utility toward execution of autonomous cyber defence. Despite the success of employing single DRL Agents, this approach presents significant limitations, especially regarding scalability within large enterprise networks. An attractive alternative to the single agent approach is the use of Multi-Agent Reinforcement Learning (MARL). However, developing MARL agents is costly with few options for examining MARL cyber defence techniques against adversarial agents. This paper presents a MARL network security environment, the fourth iteration of the Cyber Autonomy Gym for Experimentation (CAGE) challenges. This challenge was specifically designed to test the efficacy of MARL algorithms in an enterprise network. Our work aims to evaluate the potential of MARL as a robust and scalable solution for autonomous network defence.

Downloads

Published

2025-04-11

How to Cite

Kiely, M., Ahiskali, M., Borde, E., Bowman, B., Bowman, D., van Bruggen, D., … Zhang, Y. (2025). Exploring the Efficacy of Multi-Agent Reinforcement Learning for Autonomous Cyber Defence: A CAGE Challenge 4 Perspective. Proceedings of the AAAI Conference on Artificial Intelligence, 39(28), 28907–28913. https://doi.org/10.1609/aaai.v39i28.35158

Issue

Section

IAAI Technical Track on Deployed Innovative Tools for Enabling AI Applications