Huang, Changxin, Yanbin Chang, Junfan Lin, Junyang Liang, Runhao Zeng, and Jianqiang Li. “Efficient Language-Instructed Skill Acquisition via Reward-Policy Co-Evolution”. Proceedings of the AAAI Conference on Artificial Intelligence 39, no. 14 (April 11, 2025): 14576–14584. Accessed May 7, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/33597.