Toward Neurosymbolic Reinforcement Learning via Editable Specifications

Authors

  • Vedant Khandelwal AI Institute, University of South Carolina
  • Hong Yung Yip AI Institute, University of South Carolina
  • Amit Sheth AI Institute, University of South Carolina Indian AI Research Organisation (IAIRO), India

DOI:

https://doi.org/10.1609/aaaiss.v8i1.42594

Abstract

Reinforcement learning systems are commonly adapted to new settings by retraining or fine-tuning policies. This default is costly, difficult to audit, and poorly aligned with structured requirement changes such as revised safety rules, new operational constraints, or updated user preferences. We argue for an alternative abstraction: adaptation via edits to an external, human-readable specification that the agent consults at execution time. We propose conditioning decision-making on an editable knowledge graph encoding (i) applicability rules capturing action preconditions and high-level effects, (ii) hard constraints defining feasibility, and (iii) soft preferences shaping tradeoffs among feasible behaviors. Requirement changes become graph edits, not policy rewrites, operationalized through constraint-based action shielding and preference-driven objective shaping. This enables immediate, auditable behavior updates with zero or minimal gradient-based adaptation. We outline edit-based evaluation protocol and highlight open problems in specification design, edit inference, and guarantees.

Downloads

Published

2026-05-18

How to Cite

Khandelwal, V., Yip, H. Y., & Sheth, A. (2026). Toward Neurosymbolic Reinforcement Learning via Editable Specifications. Proceedings of the AAAI Symposium Series, 8(1), 603–607. https://doi.org/10.1609/aaaiss.v8i1.42594

Issue

Section

Machine Learning and Knowledge Engineering (MAKE 2026) (Position papers)