Naim, I., Song, Y., Liu, Q., Kautz, H., Luo, J., & Gildea, D. (2014). Unsupervised Alignment of Natural Language Instructions with Video Segments. Proceedings of the AAAI Conference on Artificial Intelligence, 28(1). https://doi.org/10.1609/aaai.v28i1.8939