Toward Neurosymbolic Reinforcement Learning via Editable Specifications

Vedant Khandelwal; Hong Yung Yip; Amit Sheth

doi:10.1609/aaaiss.v8i1.42594

Authors

Vedant Khandelwal AI Institute, University of South Carolina
Hong Yung Yip AI Institute, University of South Carolina
Amit Sheth AI Institute, University of South Carolina Indian AI Research Organisation (IAIRO), India

DOI:

https://doi.org/10.1609/aaaiss.v8i1.42594

Abstract

Reinforcement learning systems are commonly adapted to new settings by retraining or fine-tuning policies. This default is costly, difficult to audit, and poorly aligned with structured requirement changes such as revised safety rules, new operational constraints, or updated user preferences. We argue for an alternative abstraction: adaptation via edits to an external, human-readable specification that the agent consults at execution time. We propose conditioning decision-making on an editable knowledge graph encoding (i) applicability rules capturing action preconditions and high-level effects, (ii) hard constraints defining feasibility, and (iii) soft preferences shaping tradeoffs among feasible behaviors. Requirement changes become graph edits, not policy rewrites, operationalized through constraint-based action shielding and preference-driven objective shaping. This enables immediate, auditable behavior updates with zero or minimal gradient-based adaptation. We outline edit-based evaluation protocol and highlight open problems in specification design, edit inference, and guarantees.

Toward Neurosymbolic Reinforcement Learning via Editable Specifications

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information