Ilaslan, Muhammet Furkan, Ali Köksal, Kevin Qinghong Lin, Burak Satar, Mike Zheng Shou, and Qianli Xu. 2025. “VG-TVP: Multimodal Procedural Planning via Visually Grounded Text-Video Prompting”. Proceedings of the AAAI Conference on Artificial Intelligence 39 (4):3886-94. https://doi.org/10.1609/aaai.v39i4.32406.