TFRank: Think-Free Reasoning Enables Practical Pointwise LLM Ranking

Authors

  • Yongqi Fan East China University of Science and Technology Tencent
  • Xiaoyang Chen University of Chinese Academy of Sciences Chinese Information Processing Laboratory, Institute of Software, Chinese Academy of Sciences Tencent
  • Dezhi Ye Tencent
  • Jie Liu Tencent
  • Haijin Liang Tencent
  • Jin Ma Tencent
  • Ben He University of Chinese Academy of Sciences Chinese Information Processing Laboratory, Institute of Software, Chinese Academy of Sciences
  • Yingfei Sun University of Chinese Academy of Sciences
  • Tong Ruan East China University of Science and Technology

DOI:

https://doi.org/10.1609/aaai.v40i25.39244

Abstract

Reasoning-intensive ranking models built on Large Language Models (LLMs) have made notable progress. However, existing approaches often rely on large-scale LLMs and explicit Chain-of-Thought (CoT) reasoning, resulting in high computational cost and latency that limit real-world use. To address this, we propose TFRank, an efficient pointwise reasoning ranker based on small-scale LLMs. To improve ranking performance, TFRank effectively integrates CoT data, fine-grained score supervision, and multi-task training. Furthermore, it achieves an efficient "Think-Free" reasoning capability by employing a "think-mode switch" and pointwise format constraints. Specifically, this allows the model to leverage explicit reasoning during training while delivering precise relevance scores for complex queries at inference without generating any reasoning chains. Experiments show that TFRank achieves performance comparable to models with four times more parameters on the BRIGHT benchmark, and demonstrates strong competitiveness on the BEIR benchmark. Further analysis shows that TFRank achieves an effective balance between performance and efficiency, providing a practical solution for integrating advanced reasoning into real-world systems.

Published

2026-03-14

How to Cite

Fan, Y., Chen, X., Ye, D., Liu, J., Liang, H., Ma, J., … Ruan, T. (2026). TFRank: Think-Free Reasoning Enables Practical Pointwise LLM Ranking. Proceedings of the AAAI Conference on Artificial Intelligence, 40(25), 21020–21028. https://doi.org/10.1609/aaai.v40i25.39244

Issue

Section

AAAI Technical Track on Machine Learning II