(1)
Wang, Z.; Li, D.; Chen, Y.; Shi, Y.; Bai, L.; Yu, T.; Fu, Y. One-Step Generative Policies With Q-Learning: A Reformulation of MeanFlow. AAAI 2026, 40, 26751-26759.