Adaptive Hallucination Alleviation in Multimodal Large Language Models: From Strategic Data Selection to Severity-Guided Training

Authors

  • Yuanyi Xu Fudan University
  • Xiangru Zhu Fudan University
  • Sihang Jiang Fudan University
  • Zhixu Li Renmin University of China
  • Bei Yang Alibaba Group
  • Xiaoxiao Xu Alibaba Group
  • Yanghua Xiao Fudan University
  • Wei Wang Fudan University

DOI:

https://doi.org/10.1609/aaai.v40i32.39955

Abstract

Multimodal Large Language Models (MLLMs) have recently achieved strong performance across a variety of multimodal tasks. However, they still suffer from various forms of hallucination, which hinder their practical deployment. Prior approaches often struggle to efficiently construct high-quality hallucination-related samples and to process them in a fine-grained manner, resulting in limited effectiveness in hallucination alleviation. To address this issue, we propose a data sampling strategy that selects samples better suited for hallucination-oriented training, thereby enhancing training effectiveness. In addition, we introduce a quantitative method for measuring hallucination severity and assign individualized weights to training samples accordingly. Building on this, we present Hallucination-Differentiated Direct Preference Optimization (HD-DPO), a novel preference optimization framework. During fine-tuning, HD-DPO incorporates these weights into both the formulation of customized loss functions and the modulation of localized visual attention, enabling fine-grained optimization. Experimental results demonstrate that our method outperforms existing fine-tuning strategies across multiple benchmarks and generalizes well to diverse MLLM architectures, effectively reducing hallucination rates and enhancing overall model performance.

Downloads

Published

2026-03-14

How to Cite

Xu, Y., Zhu, X., Jiang, S., Li, Z., Yang, B., Xu, X., … Wang, W. (2026). Adaptive Hallucination Alleviation in Multimodal Large Language Models: From Strategic Data Selection to Severity-Guided Training. Proceedings of the AAAI Conference on Artificial Intelligence, 40(32), 27377–27385. https://doi.org/10.1609/aaai.v40i32.39955

Issue

Section

AAAI Technical Track on Machine Learning IX