Zha, Yantian, Lin Guan, and Subbarao Kambhampati. “Learning from Ambiguous Demonstrations With Self-Explanation Guided Reinforcement Learning”. Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 9 (March 24, 2024): 10395–10403. Accessed May 25, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/28907.