(1)
Xu, H.; He, K.; Plummer, B. A.; Sigal, L.; Sclaroff, S.; Saenko, K. Multilevel Language and Vision Integration for Text-to-Clip Retrieval. AAAI 2019, 33, 9062-9069.