Constraints-Guided Diffusion Reasoner for Neuro-Symbolic Learning

Authors

  • Xuan Zhang Fudan University Shanghai Innovation Institute
  • Zhijian Zhou Fudan University Shanghai Innovation Institute
  • Weidi Xu INFTECH
  • Yanting Miao University of Waterloo
  • Chao Qu Fudan University Shanghai Academy of Artificial Intelligence for Science
  • Yuan Qi Fudan University Shanghai Academy of Artificial Intelligence for Science

DOI:

https://doi.org/10.1609/aaai.v40i33.40074

Abstract

Enabling neural networks to learn complex logical constraints and fulfill symbolic reasoning is a critical challenge. Bridging this gap often requires guiding the neural network’s output distribution to move closer to the symbolic constraints. While diffusion models have shown remarkable generative capability across various domains, we employ the powerful architecture to perform neuro-symbolic learning and solve logical puzzles. Our diffusion-based pipeline adopts a two-stage training strategy: the first stage focuses on cultivating basic reasoning abilities, while the second emphasizes systematic learning of logical constraints. To impose hard constraints on neural outputs in the second stage, we formulate the diffusion reasoner as a Markov decision process and innovatively fine-tune it with an improved proximal policy optimization algorithm. We utilize a rule-based reward signal derived from the logical consistency of neural outputs and adopt a flexible strategy to optimize the diffusion reasoner's policy. We evaluate our methodology on some classical symbolic reasoning benchmarks, including Sudoku, Maze, pathfinding and preference learning. Experimental results demonstrate that our approach achieves outstanding accuracy and logical consistency among neural networks.

Downloads

Published

2026-03-14

How to Cite

Zhang, X., Zhou, Z., Xu, W., Miao, Y., Qu, C., & Qi, Y. (2026). Constraints-Guided Diffusion Reasoner for Neuro-Symbolic Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 40(33), 28446–28454. https://doi.org/10.1609/aaai.v40i33.40074

Issue

Section

AAAI Technical Track on Machine Learning X