Zhao, S., Ma, Y., Gu, Y., Yang, J., Xing, T., Xu, P., Hu, R., Chai, H. and Keutzer, K. (2020) “An End-to-End Visual-Audio Attention Network for Emotion Recognition in User-Generated Videos”, Proceedings of the AAAI Conference on Artificial Intelligence, 34(01), pp. 303-311. doi: 10.1609/aaai.v34i01.5364.