Zhou, Z., Guo, Y., & Hao, S. (2026). Voices, Faces, and Feelings: Multi-modal Emotion-Cognition Captioning for Mental Health Understanding. Proceedings of the AAAI Conference on Artificial Intelligence, 40(3), 2263–2271. https://doi.org/10.1609/aaai.v40i3.37210