Gupta, S., & Mooney, R. (2010). Using Closed Captions as Supervision for Video Activity Recognition. Proceedings of the AAAI Conference on Artificial Intelligence, 24(1), 1083–1088. https://doi.org/10.1609/aaai.v24i1.7738