Toward Neurosymbolic Reinforcement Learning via Editable Specifications
DOI:
https://doi.org/10.1609/aaaiss.v8i1.42594Abstract
Reinforcement learning systems are commonly adapted to new settings by retraining or fine-tuning policies. This default is costly, difficult to audit, and poorly aligned with structured requirement changes such as revised safety rules, new operational constraints, or updated user preferences. We argue for an alternative abstraction: adaptation via edits to an external, human-readable specification that the agent consults at execution time. We propose conditioning decision-making on an editable knowledge graph encoding (i) applicability rules capturing action preconditions and high-level effects, (ii) hard constraints defining feasibility, and (iii) soft preferences shaping tradeoffs among feasible behaviors. Requirement changes become graph edits, not policy rewrites, operationalized through constraint-based action shielding and preference-driven objective shaping. This enables immediate, auditable behavior updates with zero or minimal gradient-based adaptation. We outline edit-based evaluation protocol and highlight open problems in specification design, edit inference, and guarantees.Downloads
Published
2026-05-18
How to Cite
Khandelwal, V., Yip, H. Y., & Sheth, A. (2026). Toward Neurosymbolic Reinforcement Learning via Editable Specifications. Proceedings of the AAAI Symposium Series, 8(1), 603–607. https://doi.org/10.1609/aaaiss.v8i1.42594
Issue
Section
Machine Learning and Knowledge Engineering (MAKE 2026) (Position papers)