Aware First, Think Less: Dynamic Boundary Self-Awareness Drives Significant Gains in Reasoning Efficiency in Large Language Models

Authors

  • Qiguang Chen Harbin Institute of Technology
  • Dengyun Peng Harbin Institute of Technology
  • Jinhao Liu Harbin Institute of Technology
  • Huikang Su Harbin Institute of Technology
  • Jiannan Guan Harbin Institute of Technology
  • Libo Qin Central South University
  • Wanxiang Che Harbin Institute of Technology

DOI:

https://doi.org/10.1609/aaai.v40i36.40277

Abstract

Recent advancements in large language models (LLMs) have greatly improved their ability to perform complex reasoning tasks through Long Chain-of-Thought (CoT). However, this approach often results in substantial redundancy, impairing computational efficiency and causing significant delays in real-time applications. To improve efficiency, current methods often rely on human-defined difficulty priors, which do not align with the LLM's self-awared difficulty, leading to inefficiencies. In this paper, we introduce the Dynamic Reasoning-Boundary Self-Awareness Framework (DR. SAF), which enables LLMs to dynamically assess and adjust their reasoning depth in response to problem complexity. DR. SAF integrates three key components: Boundary Self-Awareness Alignment, Adaptive Reward Management, and a Boundary Preservation Mechanism. These components allow models to optimize their reasoning processes, balancing efficiency and accuracy without compromising performance. Our experimental results demonstrate that DR. SAF achieves a 49.27% reduction in total response tokens with minimal loss in accuracy. The framework also delivers a 6.59x gain in token efficiency and a 5x reduction in training time, making it well-suited to resource-limited settings. During extreme training, DR. SAF can even surpass traditional instruction-based models in token efficiency with more than 16% accuracy improvement.

Downloads

Published

2026-03-14

How to Cite

Chen, Q., Peng, D., Liu, J., Su, H., Guan, J., Qin, L., & Che, W. (2026). Aware First, Think Less: Dynamic Boundary Self-Awareness Drives Significant Gains in Reasoning Efficiency in Large Language Models. Proceedings of the AAAI Conference on Artificial Intelligence, 40(36), 30261–30269. https://doi.org/10.1609/aaai.v40i36.40277

Issue

Section

AAAI Technical Track on Natural Language Processing I