Li, Zihan, Wei Sun, Jing Hu, Jianhua Yin, Xing Wang, Erwei Yin, and Jianlong Wu. “Self-Enhanced Image Clustering With Cross-Modal Semantic Consistency”. Proceedings of the AAAI Conference on Artificial Intelligence 40, no. 28 (March 14, 2026): 23364–23372. Accessed May 15, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/39506.