(1)
Ilaslan, M. F.; Köksal, A.; Lin, K. Q.; Satar, B.; Shou, M. Z.; Xu, Q. VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting. AAAI 2025, 39, 3886-3894.