[1]
G. Zhang, “CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation”, AAAI, vol. 40, no. 15, pp. 12430–12438, Mar. 2026.