Wang, B., Xu, Y., Han, Y., & Hong, R. (2018). Movie Question Answering: Remembering the Textual Cues for Layered Visual Contents. Proceedings of the AAAI Conference on Artificial Intelligence, 32(1). https://doi.org/10.1609/aaai.v32i1.12253