Revisiting Differentiable Structure Learning: Inconsistency of L1 Penalty and Beyond

Authors

  • Kaifeng Jin University of Illinois Urbana-Champaign
  • Ignavier Ng Carnegie Mellon University
  • Kun Zhang Carnegie Mellon University Mohamed bin Zayed University of Artificial Intelligence
  • Biwei Huang University of California San Diego

DOI:

https://doi.org/10.1609/aaai.v40i27.39398

Abstract

Recent advances in differentiable structure learning have framed the combinatorial problem of learning directed acyclic graphs as a continuous optimization problem. Various aspects, including data standardization, have been studied to identify factors that influence the empirical performance of these methods. In this work, we investigate critical limitations in differentiable structure learning methods, focusing on settings where the true structure can be identified up to Markov equivalence classes, particularly in the linear Gaussian case. While recent work highlighted potential non-convexity issues in this setting, we demonstrate and explain why the use of L1-penalized likelihood in such cases is fundamentally inconsistent, even if the global optimum of the optimization problem can be found. To resolve this limitation, we develop a hybrid differentiable structure learning method based on L0-penalized likelihood with hard acyclicity constraint, where the L0 penalty can be approximated by different techniques including Gumbel-Softmax. Specifically, we first estimate the underlying moral graph, and use it to restrict the search space of the optimization problem, which helps alleviate the non-convexity issue. Experimental results show that the proposed method enhances empirical performance both before and after data standardization, providing a more reliable path for future advancements in differentiable structure learning, especially for learning Markov equivalence classes.

Downloads

Published

2026-03-14

How to Cite

Jin, K., Ng, I., Zhang, K., & Huang, B. (2026). Revisiting Differentiable Structure Learning: Inconsistency of L1 Penalty and Beyond. Proceedings of the AAAI Conference on Artificial Intelligence, 40(27), 22399-22407. https://doi.org/10.1609/aaai.v40i27.39398

Issue

Section

AAAI Technical Track on Machine Learning IV