Loss-Guided Auxiliary Agents for Overcoming Mode Collapse in GFlowNets

Authors

  • Idriss Malek Mohamed bin Zayed University of Artificial Intelligence
  • Aya Laajil Mohamed bin Zayed University of Artificial Intelligence
  • Abhijith Sharma Mohamed bin Zayed University of Artificial Intelligence
  • Eric Moulines Mohamed bin Zayed University of Artificial Intelligence
  • Salem Lahlou Mohamed bin Zayed University of Artificial Intelligence

DOI:

https://doi.org/10.1609/aaai.v40i29.39613

Abstract

Although Generative Flow Networks (GFlowNets) are designed to capture multiple modes of a reward function, they often suffer from mode collapse in practice, getting trapped in early-discovered modes and requiring prolonged training to find diverse solutions. Existing exploration techniques often rely on heuristic novelty signals. We propose Loss-Guided GFlowNets (LGGFN), a novel approach where an auxiliary GFlowNet's exploration is directly driven by the main GFlowNet's training loss. By prioritizing trajectories where the main model exhibits high loss, LGGFN focuses sampling on poorly understood regions of the state space. This targeted exploration significantly accelerates the discovery of diverse, high-reward samples. Empirically, across diverse benchmarks including grid environments, structured sequence generation, Bayesian structure learning, and biological sequence design, LGGFN consistently outperforms baselines in exploration efficiency and sample diversity. For instance, on a challenging sequence generation task, it discovered over 40 times more unique valid modes while simultaneously reducing the exploration error metric by approximately 99%.

Downloads

Published

2026-03-14

How to Cite

Malek, I., Laajil, A., Sharma, A., Moulines, E., & Lahlou, S. (2026). Loss-Guided Auxiliary Agents for Overcoming Mode Collapse in GFlowNets. Proceedings of the AAAI Conference on Artificial Intelligence, 40(29), 24326-24334. https://doi.org/10.1609/aaai.v40i29.39613

Issue

Section

AAAI Technical Track on Machine Learning VI