1.
Yang J, Zhu B, Chen J, Jiang Y-G. Actor-Critic for Continuous Action Chunks: A Reinforcement Learning Framework for Long-Horizon Robotic Manipulation with Sparse Reward. AAAI [Internet]. 2026 Mar. 14 [cited 2026 May 27];40(22):18692-700. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/38937