Reducing Goal State Divergence with Environment Design
DOI:
https://doi.org/10.1609/aaai.v40i21.38828Abstract
Generating behaviors that align with human expectations is a key requirement for human-robot collaboration. Potential behavior misalignment could lead to the robot performing actions with unanticipated, potentially dangerous side effects even while pursuing human goals. In this paper, we introduce a novel metric called Goal State Divergence (GSD) which quantifies the difference between the state a robot achieved in response to a human-specified goal and what the human expected. In cases where GSD cannot be directly calculated, we show how it can be approximated using maximal and minimal bounds. We then leverage GSD in our novel human-robot goal alignment design (HRGAD) problem, which identifies a minimal set of environment modifications that can reduce such mismatches. We show the effectiveness of our method in reducing the goal state divergence by empirically evaluating our approach on several planning benchmarks.Downloads
Published
2026-03-14
How to Cite
Sikes, K., Keren, S., & Sreedharan, S. (2026). Reducing Goal State Divergence with Environment Design. Proceedings of the AAAI Conference on Artificial Intelligence, 40(21), 17715–17723. https://doi.org/10.1609/aaai.v40i21.38828
Issue
Section
AAAI Technical Track on Humans and AI