Yang, L. (2026) “ProBench: Benchmarking GUI Agents with Accurate Process Information”, Proceedings of the AAAI Conference on Artificial Intelligence, 40(32), pp. 27547–27555. doi: 10.1609/aaai.v40i32.39974.