Belief-Driven Value Alignment for Human-Robot Collaboration

Authors

  • Saisai Li Wuhan University of Technology
  • Bing Shi Wuhan University of Technology
  • Yiming Xia Wuhan University of Technology
  • Xiao Su Wuhan University of Technology

DOI:

https://doi.org/10.1609/aaai.v40i21.38809

Abstract

As intelligent systems advance rapidly, human-robot collaboration is becoming increasingly important. Ensuring that the intelligent agent's behaviors match human intentions and value preferences is crucial for effective collaboration, which is termed the value alignment problem. Within the Reinforcement Learning (RL) paradigm, value alignment typically relies on pre-designed reward functions, and Cooperative Inverse Reinforcement Learning (CIRL) is often used to model value alignment as a human-robot game. However, existing works often assume that human is perfectly rational, and can fully obtain robot’s belief on human’s preference. To address this limitation, we propose a Particle Filter-based Hierarchical Dynamic Programming algorithm (PFHDP). By modeling the robot's belief state, this algorithm ensures the correct updates of human's estimate of the robot's belief. This allows human to adopt more targeted pedagogical behaviors to guide the robot based on her understanding of the robot's current belief, achieving belief alignment between human and robot and thereby promoting value alignment more effectively. Furthermore, we run experiments to evaluate the proposed method in two cooperative scenarios against some typical benchmark approaches. The experimental results show that our method can strengthen the alignment of belief states between human and robot, leading to enhanced value alignment.

Downloads

Published

2026-03-14

How to Cite

Li, S., Shi, B., Xia, Y., & Su, X. (2026). Belief-Driven Value Alignment for Human-Robot Collaboration. Proceedings of the AAAI Conference on Artificial Intelligence, 40(21), 17544–17552. https://doi.org/10.1609/aaai.v40i21.38809

Issue

Section

AAAI Technical Track on Humans and AI