Ling, Run, Ke Cao, Jian Lu, Ao Ma, Haowei Liu, Runze He, Changwei Wang, et al. “MoFu: Scale-Aware Modulation and Fourier Fusion for Multi-Subject Video Generation”. Proceedings of the AAAI Conference on Artificial Intelligence 40, no. 9 (March 14, 2026): 7033–7041. Accessed May 25, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/37638.