Zhou, Zhiyuan, Yanrong Guo, and Shijie Hao. “Voices, Faces, and Feelings: Multi-Modal Emotion-Cognition Captioning for Mental Health Understanding”. Proceedings of the AAAI Conference on Artificial Intelligence 40, no. 3 (March 14, 2026): 2263–2271. Accessed May 27, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/37210.