Reducing Goal State Divergence with Environment Design

Authors

  • Kelsey Sikes Colorado State University
  • Sarah Keren Technion-Israel Institute of Technology
  • Sarath Sreedharan Colorado State University

DOI:

https://doi.org/10.1609/aaai.v40i21.38828

Abstract

Generating behaviors that align with human expectations is a key requirement for human-robot collaboration. Potential behavior misalignment could lead to the robot performing actions with unanticipated, potentially dangerous side effects even while pursuing human goals. In this paper, we introduce a novel metric called Goal State Divergence (GSD) which quantifies the difference between the state a robot achieved in response to a human-specified goal and what the human expected. In cases where GSD cannot be directly calculated, we show how it can be approximated using maximal and minimal bounds. We then leverage GSD in our novel human-robot goal alignment design (HRGAD) problem, which identifies a minimal set of environment modifications that can reduce such mismatches. We show the effectiveness of our method in reducing the goal state divergence by empirically evaluating our approach on several planning benchmarks.

Published

2026-03-14

How to Cite

Sikes, K., Keren, S., & Sreedharan, S. (2026). Reducing Goal State Divergence with Environment Design. Proceedings of the AAAI Conference on Artificial Intelligence, 40(21), 17715–17723. https://doi.org/10.1609/aaai.v40i21.38828

Issue

Section

AAAI Technical Track on Humans and AI