MARLIN: Multi-Agent Reinforcement Learning for Incremental DAG Discovery

Authors

  • Dong Li Department of Computer Science, Baylor University
  • Zhengzhang Chen NEC Labs America
  • Xujiang Zhao NEC Labs America
  • Linlin Yu School of Computer and Cyber Sciences, Augusta University
  • Zhong Chen School of Computing, Southern Illinois University
  • Yi He Department of Data Science, The College of William and Mary
  • Haifeng Chen NEC Labs America
  • Chen Zhao Department of Computer Science, Baylor University

DOI:

https://doi.org/10.1609/aaai.v40i27.39450

Abstract

Uncovering causal structures from observational data is crucial for understanding complex systems and making informed decisions. While reinforcement learning (RL) has shown promise in identifying these structures in the form of a directed acyclic graph (DAG), existing methods often lack efficiency, making them unsuitable for online applications. In this paper, we propose MARLIN, an efficient multi-agent RL-based approach for incremental DAG learning. MARLIN uses a DAG generation policy that maps a continuous real-valued space to the DAG space as an intra-batch strategy, then incorporates two RL agents—state-specific and state-invariant—to uncover causal relationships and integrates these agents into an incremental learning framework. Furthermore, the framework leverages a factored action space to enhance parallelization efficiency. Extensive experiments on synthetic and real datasets demonstrate that MARLIN outperforms state-of-the-art methods in terms of both efficiency and effectiveness.

Downloads

Published

2026-03-14

How to Cite

Li, D., Chen, Z., Zhao, X., Yu, L., Chen, Z., He, Y., … Zhao, C. (2026). MARLIN: Multi-Agent Reinforcement Learning for Incremental DAG Discovery. Proceedings of the AAAI Conference on Artificial Intelligence, 40(27), 22869–22877. https://doi.org/10.1609/aaai.v40i27.39450

Issue

Section

AAAI Technical Track on Machine Learning IV