1.
Huang C, Chang Y, Lin J, Liang J, Zeng R, Li J. Efficient Language-instructed Skill Acquisition via Reward-Policy Co-Evolution. AAAI [Internet]. 2025 Apr. 11 [cited 2026 May 7];39(14):14576-84. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/33597