Huang, Yufeng, Jiji Tang, Zhuo Chen, Rongsheng Zhang, Xinfeng Zhang, Weijie Chen, Zeng Zhao, et al. 2024. “Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-Modal Structured Representations”. Proceedings of the AAAI Conference on Artificial Intelligence 38 (3):2417-25. https://doi.org/10.1609/aaai.v38i3.28017.