Perceiving the Knowledge Boundary: Uncertainty-Guided Exploration and Imagination for World Models
DOI:
https://doi.org/10.1609/aaai.v40i28.39576Abstract
World-model-based reinforcement learning achieves high sample efficiency by learning from imagined rollouts. However, its success critically depends on the accuracy of the learned world model, which is prone to producing unrealistic or hallucinated rollouts when queried beyond its domain of competence. These flawed predictions can trap the agent in a vicious cycle: by misleading exploration toward implausible or uninformative regions, they degrade the quality of collected data, which in turn corrupts policy learning with inaccurate rollouts. To break this cycle, we introduce the notion of a knowledge boundary—the region within which the world model provides reliable predictions—and propose a unified framework that both identifies and leverages this boundary. Concretely, we approximate the boundary using model uncertainty, quantified via disagreement across an ensemble of lightweight predictors, which serves as a practical proxy. This uncertainty signal is used in two complementary ways: as an intrinsic reward to guide exploration toward under-explored yet learnable regions, and as a dynamic filter to exclude unreliable imagined rollouts from policy optimization. Extensive experiments across diverse benchmarks—including CARLA, DeepMind Control Suite, Atari, and MemoryMaze—demonstrate that our approach consistently outperforms prior state-of-the-art methods.Downloads
Published
2026-03-14
How to Cite
Liu, Z., Peng, P., Huang, Y., & Tian, Y. (2026). Perceiving the Knowledge Boundary: Uncertainty-Guided Exploration and Imagination for World Models. Proceedings of the AAAI Conference on Artificial Intelligence, 40(28), 23990–23998. https://doi.org/10.1609/aaai.v40i28.39576
Issue
Section
AAAI Technical Track on Machine Learning V