Knowledge Boundary Discovery for Large Language Models
DOI:
https://doi.org/10.1609/aaai.v40i40.40670Abstract
We propose Knowledge Boundary Discovery (KBD), a reinforcement learning based framework to explore the knowledge boundaries of the Large Language Models (LLMs). We define the knowledge boundary by automatically generating two types of questions: (i) those the LLM can confidently answer within-knowledge boundary and (ii) those it cannot beyond-knowledge boundary. Iteratively exploring and exploiting the LLM's responses to find its knowledge boundaries is challenging because of the hallucination phenomenon. To find the knowledge boundaries of an LLM, the agent interacts with the LLM under the modeling of exploring a partially observable environment. The agent generates a progressive question as the action, adopts an entropy reduction as the reward, receives the LLM's response as the observation and updates its belief states. We demonstrate that the KBD detects knowledge boundaries of LLMs by automatically finding a set of non-trivial answerable and unanswerable questions. We validate the KBD by comparing its generated knowledge boundaries with manually crafted LLM benchmark datasets. Experiments show that our KBD-generated question set is comparable to the human-generated datasets. Our approach paves a new way to evaluate LLMs.Downloads
Published
2026-03-14
How to Cite
Wang, Z., & Lu, Z. (2026). Knowledge Boundary Discovery for Large Language Models. Proceedings of the AAAI Conference on Artificial Intelligence, 40(40), 33791–33799. https://doi.org/10.1609/aaai.v40i40.40670
Issue
Section
AAAI Technical Track on Natural Language Processing V