Xu, H., K. He, B. A. Plummer, L. Sigal, S. Sclaroff, and K. Saenko. “Multilevel Language and Vision Integration for Text-to-Clip Retrieval”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, no. 01, July 2019, pp. 9062-9, doi:10.1609/aaai.v33i01.33019062.