(1)
Xu, X.; Bu, P.; Wang, Y.; Karlsson, B. F.; Wang, Z.; Song, T.; Zhu, Q.; Song, J.; Ding, Z.; Zheng, B. DeepPhy: Benchmarking Agentic VLMs on Physical Reasoning. AAAI 2026, 40, 34160-34168.