Zhang, G., Zhong, T., Xia, Y., Liu, M., Yu, Z., Li, H., … Jiang, H. (2026). CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation. Proceedings of the AAAI Conference on Artificial Intelligence, 40(15), 12430–12438. https://doi.org/10.1609/aaai.v40i15.38236