GENG, Shijie; GAO, Peng; CHATTERJEE, Moitreya; HORI, Chiori; LE ROUX, Jonathan; ZHANG, Yongfeng; LI, Hongsheng; CHERIAN, Anoop. Dynamic Graph Representation Learning for Video Dialog via Multi-Modal Shuffled Transformers. Proceedings of the AAAI Conference on Artificial Intelligence, [S. l.], v. 35, n. 2, p. 1415–1423, 2021. DOI: 10.1609/aaai.v35i2.16231. Disponível em: https://ojs.aaai.org/index.php/AAAI/article/view/16231. Acesso em: 21 jul. 2026.