[1]

Pan, M., Gan, W., Chen, J., Zhang, W., Bing, S., Yin, J. and Zhang, X. 2026. Ground What You See: Hallucination-Resistant MLLMs via Caption Feedback, Diversity-Aware Sampling, and Conflict Regularization. Proceedings of the AAAI Conference on Artificial Intelligence. 40, 10 (Mar. 2026), 8242-8250. DOI:https://doi.org/10.1609/aaai.v40i10.37772.