1.
Zou G, Li W, Wu H, Qian Y, Wang Y, Wang H. D²PPO: Diffusion Policy Policy Optimization with Dispersive Loss. AAAI [Internet]. 2026 Mar. 14 [cited 2026 May 27];40(22):18891-9. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/38959