[1]
G. Zou, W. Li, H. Wu, Y. Qian, Y. Wang, and H. Wang, “D²PPO: Diffusion Policy Policy Optimization with Dispersive Loss”, AAAI, vol. 40, no. 22, pp. 18891–18899, Mar. 2026.