(1)
Sun, Z.; Sarma, P.; Sethares, W.; Liang, Y. Learning Relationships Between Text, Audio, and Video via Deep Canonical Correlation for Multimodal Language Analysis. AAAI 2020, 34, 8992-8999.