Yang, L., Wang, Z., Tang, X., Zhou, S., Chen, D., Jiang, W., & Li, Y. (2026). ProBench: Benchmarking GUI Agents with Accurate Process Information. Proceedings of the AAAI Conference on Artificial Intelligence, 40(32), 27547–27555. https://doi.org/10.1609/aaai.v40i32.39974