CrAM: Credibility-Aware Attention Modification in LLMs for Combating Misinformation in RAG

Authors

  • Boyi Deng University of Science and Technology of China
  • Wenjie Wang National University of Singapore
  • Fengbin Zhu National University of Singapore
  • Qifan Wang Meta AI
  • Fuli Feng University of Science and Technology of China

DOI:

https://doi.org/10.1609/aaai.v39i22.34547

Abstract

Retrieval-Augmented Generation (RAG) can alleviate hallucinations of Large Language Models (LLMs) by referencing external documents. However, the misinformation in external documents may mislead LLMs' generation. To address this issue, we explore the task of "credibility-aware RAG", in which LLMs automatically adjust the influence of retrieved documents based on their credibility scores to counteract misinformation. To this end, we introduce a plug-and-play method named Credibility-aware Attention Modification (CrAM). CrAM identifies influential attention heads in LLMs and adjusts their attention weights based on the credibility of the documents, thereby reducing the impact of low-credibility documents. Experiments on Natual Questions and TriviaQA using Llama2-13B, Llama3-8B, and Qwen1.5-7B show that CrAM improves the RAG performance of LLMs against misinformation pollution by over 20%, even surpassing supervised fine-tuning methods.

Published

2025-04-11

How to Cite

Deng, B., Wang, W., Zhu, F., Wang, Q., & Feng, F. (2025). CrAM: Credibility-Aware Attention Modification in LLMs for Combating Misinformation in RAG. Proceedings of the AAAI Conference on Artificial Intelligence, 39(22), 23760–23768. https://doi.org/10.1609/aaai.v39i22.34547

Issue

Section

AAAI Technical Track on Natural Language Processing I