(1)
Zou, G.; Li, W.; Wu, H.; Qian, Y.; Wang, Y.; Wang, H. D²PPO: Diffusion Policy Policy Optimization With Dispersive Loss. AAAI 2026, 40, 18891-18899.