Cheng, Z. (2025) “CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language Models”, Proceedings of the AAAI Conference on Artificial Intelligence, 39(22), pp. 23678–23686. doi: 10.1609/aaai.v39i22.34538.