Kim, Young-Jin, Min-Jun Kim, Kyunghwan An, Jinwoo Ahn, Jaeseok Kim, Yu-Jung Heo, Du-Seong Chang, and Eun-Sol Kim. “Structure-Aware Multimodal Sequential Learning for Visual Dialog”. Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 12 (March 24, 2024): 13193-13201. Accessed November 21, 2024. https://ojs.aaai.org/index.php/AAAI/article/view/29219.