RAGFort: Dual-Path Defense Against Proprietary Knowledge Base Extraction in Retrieval-Augmented Generation

Authors

  • Qinfeng Li Zhejiang University
  • Miao Pan Zhejiang University
  • Ke Xiong Zhejiang University
  • Ge Su Zhejiang University
  • Zhiqiang Shen Ant Group
  • Yan Liu Ant Group
  • Sun Bing Universal Identification Technology (Hangzhou) Co., Ltd.
  • Hao Peng Zhejiang Normal University
  • Xuhong Zhang Zhejiang University Ningbo Global Innovation Center, Zhejiang University

DOI:

https://doi.org/10.1609/aaai.v40i37.40432

Abstract

Retrieval-Augmented Generation (RAG) systems deployed over proprietary knowledge bases face growing threats from reconstruction attacks that aggregate model responses to replicate knowledge bases. Such attacks exploit both intra-class and inter-class paths—progressively extracting fine-grained knowledge within topics and diffusing it across semantically related ones, thereby enabling comprehensive extraction of the original knowledge base. However, existing defenses target only one path, leaving the other unprotected. We conduct a systematic exploration to assess the impact of protecting each path independently and find that joint protection is essential for effective defense. Based on this, we propose RAGFort, a structure-aware dual-module defense combining contrastive reindexing for inter-class isolation and constrained cascade generation for intra-class protection. Experiments across security, performance, and robustness confirm that RAGFort significantly reduces reconstruction success while preserving answer quality, offering the first comprehensive defense against knowledge base extraction attacks.

Downloads

Published

2026-03-14

How to Cite

Li, Q., Pan, M., Xiong, K., Su, G., Shen, Z., Liu, Y., … Zhang, X. (2026). RAGFort: Dual-Path Defense Against Proprietary Knowledge Base Extraction in Retrieval-Augmented Generation. Proceedings of the AAAI Conference on Artificial Intelligence, 40(37), 31654–31661. https://doi.org/10.1609/aaai.v40i37.40432

Issue

Section

AAAI Technical Track on Natural Language Processing II