[1]
Z. Zhou, Y. Guo, and S. Hao, “Voices, Faces, and Feelings: Multi-modal Emotion-Cognition Captioning for Mental Health Understanding”, AAAI, vol. 40, no. 3, pp. 2263–2271, Mar. 2026.