Specification-Guided Reinforcement Learning

Tanmay Ambadkar

doi:10.1609/aaai.v40i48.42142

Specification-Guided Reinforcement Learning

Authors

Tanmay Ambadkar Pennsylvania State University

DOI:

https://doi.org/10.1609/aaai.v40i48.42142

Abstract

While Reinforcement Learning (RL) has demonstrated remarkable success in solving complex sequential decision-making problems, its application in real-world, safety-critical systems is hindered by its reliance on carefully engineered reward functions. Designing effective rewards is notoriously challenging and can lead to unintended or unsafe behaviors, a phenomenon known as reward hacking. Specification-guided RL has emerged as a principled alternative, leveraging formal methods to directly encode high-level objectives, safety requirements, and behavioral constraints. However, the practical utility of this approach is often limited by coarse or under-specified logical formulas and the computational challenge of enforcing safety at scale. This thesis addresses these limitations by developing a unified framework for the automated refinement, scalable enforcement, and flexible adaptation of formal specifications in RL.

AAAI-26 / IAAI-26 / EAAI-26 Proceedings Cover

Downloads

PDF
Poster

Published

2026-03-14

How to Cite

Ambadkar, T. (2026). Specification-Guided Reinforcement Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 40(48), 41032–41033. https://doi.org/10.1609/aaai.v40i48.42142

Download Citation

Issue

Vol. 40 No. 48: EAAI-26 AI for Education, Model AI Assignments, AAAI-26 Emerging Trends, Doctoral Consortium, Student Abstracts, Undergraduate Consortium and Demonstrations

Section

AAAI Doctoral Consortium Track

Specification-Guided Reinforcement Learning

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information