Pan, M., W. Gan, J. Chen, W. Zhang, S. Bing, J. Yin, and X. Zhang. “Ground What You See: Hallucination-Resistant MLLMs via Caption Feedback, Diversity-Aware Sampling, and Conflict Regularization”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 40, no. 10, Mar. 2026, pp. 8242-50, doi:10.1609/aaai.v40i10.37772.