(1)
Wang, Z.; You, H.; Li, L. H.; Zareian, A.; Park, S.; Liang, Y.; Chang, K.-W.; Chang, S.-F. SGEITL: Scene Graph Enhanced Image-Text Learning for Visual Commonsense Reasoning. AAAI 2022, 36, 5914-5922.