Zhao, J., Y. Huang, and F. Lu. “Learning Procedural-Aware Video Representations Through State-Grounded Hierarchy Unfolding”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 40, no. 16, Mar. 2026, pp. 13172-80, doi:10.1609/aaai.v40i16.38318.