[1]
L. Zhang, X. Zhang, and J. Pan, “Hierarchical Cross-Modality Semantic Correlation Learning Model for Multimodal Summarization”, AAAI, vol. 36, no. 10, pp. 11676-11684, Jun. 2022.