Wang, Zeyuan, Da Li, Yulin Chen, Ye Shi, Liang Bai, Tianyuan Yu, and Yanwei Fu. “One-Step Generative Policies With Q-Learning: A Reformulation of MeanFlow”. Proceedings of the AAAI Conference on Artificial Intelligence 40, no. 31 (March 14, 2026): 26751–26759. Accessed May 16, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/39885.