(1)
Pan, M.; Gan, W.; Chen, J.; Zhang, W.; Bing, S.; Yin, J.; Zhang, X. Ground What You See: Hallucination-Resistant MLLMs via Caption Feedback, Diversity-Aware Sampling, and Conflict Regularization. AAAI 2026, 40, 8242-8250.