Zhang, L., Zhang, X., & Pan, J. (2022). Hierarchical Cross-Modality Semantic Correlation Learning Model for Multimodal Summarization. Proceedings of the AAAI Conference on Artificial Intelligence, 36(10), 11676–11684. https://doi.org/10.1609/aaai.v36i10.21422