(1)
Li, G.; Zhao, B.; Yang, J.; Sevilla-Lara, L. Mask2IV: Interaction-Centric Video Generation via Mask Trajectories. AAAI 2026, 40, 6091-6099.