Naim, I., Y. Song, Q. Liu, H. Kautz, J. Luo, and D. Gildea. “Unsupervised Alignment of Natural Language Instructions With Video Segments”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 28, no. 1, June 2014, doi:10.1609/aaai.v28i1.8939.