Think Then Rewrite: Reasoning Enhanced Query Rewriting for Domain Specific Retrieval

Authors

  • Ang Li Zhejiang University
  • Yufei Shi Hong Kong Polytechnic University
  • Yuxuan Si Zhejiang University
  • Yiquan Wu Zhejiang University
  • Ming Cai Zhejiang University
  • Xu Tan Zhejiang University of Science and Technology
  • Yi Wang Chongqing Ant Consumer Finance Co,. Ltd
  • Changlong Sun Zhejiang University
  • Xiaozhong Liu Worcester Polytechnic Institute
  • Kun Kuang Zhejiang University

DOI:

https://doi.org/10.1609/aaai.v40i18.38527

Abstract

Query rewriting is a crucial task for improving retrieval, especially in professional domains such as law and medicine, where user queries are often underspecified and ambiguous. While large language models (LLMs) offer strong understanding and generation capabilities, existing LLM-based approaches reduce the task to text transformation or expansion, neglecting reasoning to disambiguate queries, which fails to bridge the cognitive gap between user queries and specialized documents. In this paper, we propose Think-Then-Rewrite (TTR), a reinforcement learning based framework that unleashes LLMs' reasoning ability for domain-specific query rewriting. TTR introduces a contrastive mutual information reward to encourage the LLM to generate reasoning processes that effectively distinguish confusing distractors. To boost early-stage training, TTR also constructs golden query rewrites as off‑policy data, providing strong guidance for RL learning. A mixed-policy optimization then combines on-policy and off-policy signals, ensuring both effectiveness and stability. Extensive experiments on legal and medical retrieval benchmarks demonstrate that TTR achieves state-of-the-art performance.

Published

2026-03-14

How to Cite

Li, A., Shi, Y., Si, Y., Wu, Y., Cai, M., Tan, X., … Kuang, K. (2026). Think Then Rewrite: Reasoning Enhanced Query Rewriting for Domain Specific Retrieval. Proceedings of the AAAI Conference on Artificial Intelligence, 40(18), 15045–15053. https://doi.org/10.1609/aaai.v40i18.38527

Issue

Section

AAAI Technical Track on Data Mining & Knowledge Management II