Belief-Driven Value Alignment for Human-Robot Collaboration
DOI:
https://doi.org/10.1609/aaai.v40i21.38809Abstract
As intelligent systems advance rapidly, human-robot collaboration is becoming increasingly important. Ensuring that the intelligent agent's behaviors match human intentions and value preferences is crucial for effective collaboration, which is termed the value alignment problem. Within the Reinforcement Learning (RL) paradigm, value alignment typically relies on pre-designed reward functions, and Cooperative Inverse Reinforcement Learning (CIRL) is often used to model value alignment as a human-robot game. However, existing works often assume that human is perfectly rational, and can fully obtain robot’s belief on human’s preference. To address this limitation, we propose a Particle Filter-based Hierarchical Dynamic Programming algorithm (PFHDP). By modeling the robot's belief state, this algorithm ensures the correct updates of human's estimate of the robot's belief. This allows human to adopt more targeted pedagogical behaviors to guide the robot based on her understanding of the robot's current belief, achieving belief alignment between human and robot and thereby promoting value alignment more effectively. Furthermore, we run experiments to evaluate the proposed method in two cooperative scenarios against some typical benchmark approaches. The experimental results show that our method can strengthen the alignment of belief states between human and robot, leading to enhanced value alignment.Published
2026-03-14
How to Cite
Li, S., Shi, B., Xia, Y., & Su, X. (2026). Belief-Driven Value Alignment for Human-Robot Collaboration. Proceedings of the AAAI Conference on Artificial Intelligence, 40(21), 17544–17552. https://doi.org/10.1609/aaai.v40i21.38809
Issue
Section
AAAI Technical Track on Humans and AI