Specification-Guided Reinforcement Learning
DOI:
https://doi.org/10.1609/aaai.v40i48.42142Abstract
While Reinforcement Learning (RL) has demonstrated remarkable success in solving complex sequential decision-making problems, its application in real-world, safety-critical systems is hindered by its reliance on carefully engineered reward functions. Designing effective rewards is notoriously challenging and can lead to unintended or unsafe behaviors, a phenomenon known as reward hacking. Specification-guided RL has emerged as a principled alternative, leveraging formal methods to directly encode high-level objectives, safety requirements, and behavioral constraints. However, the practical utility of this approach is often limited by coarse or under-specified logical formulas and the computational challenge of enforcing safety at scale. This thesis addresses these limitations by developing a unified framework for the automated refinement, scalable enforcement, and flexible adaptation of formal specifications in RL.Published
2026-03-14
How to Cite
Ambadkar, T. (2026). Specification-Guided Reinforcement Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 40(48), 41032–41033. https://doi.org/10.1609/aaai.v40i48.42142
Issue
Section
AAAI Doctoral Consortium Track