Xu, H. (2019) “Multilevel Language and Vision Integration for Text-to-Clip Retrieval”, Proceedings of the AAAI Conference on Artificial Intelligence, 33(01), pp. 9062–9069. doi: 10.1609/aaai.v33i01.33019062.