Belief-Driven Value Alignment for Human-Robot Collaboration

Saisai Li; Bing Shi; Yiming Xia; Xiao Su

doi:10.1609/aaai.v40i21.38809

Authors

Saisai Li Wuhan University of Technology
Bing Shi Wuhan University of Technology
Yiming Xia Wuhan University of Technology
Xiao Su Wuhan University of Technology

DOI:

https://doi.org/10.1609/aaai.v40i21.38809

Abstract

As intelligent systems advance rapidly, human-robot collaboration is becoming increasingly important. Ensuring that the intelligent agent's behaviors match human intentions and value preferences is crucial for effective collaboration, which is termed the value alignment problem. Within the Reinforcement Learning (RL) paradigm, value alignment typically relies on pre-designed reward functions, and Cooperative Inverse Reinforcement Learning (CIRL) is often used to model value alignment as a human-robot game. However, existing works often assume that human is perfectly rational, and can fully obtain robot’s belief on human’s preference. To address this limitation, we propose a Particle Filter-based Hierarchical Dynamic Programming algorithm (PFHDP). By modeling the robot's belief state, this algorithm ensures the correct updates of human's estimate of the robot's belief. This allows human to adopt more targeted pedagogical behaviors to guide the robot based on her understanding of the robot's current belief, achieving belief alignment between human and robot and thereby promoting value alignment more effectively. Furthermore, we run experiments to evaluate the proposed method in two cooperative scenarios against some typical benchmark approaches. The experimental results show that our method can strengthen the alignment of belief states between human and robot, leading to enhanced value alignment.

Belief-Driven Value Alignment for Human-Robot Collaboration

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information