Zou, G., Li, W., Wu, H., Qian, Y., Wang, Y., & Wang, H. (2026). D²PPO: Diffusion Policy Policy Optimization with Dispersive Loss. Proceedings of the AAAI Conference on Artificial Intelligence, 40(22), 18891–18899. https://doi.org/10.1609/aaai.v40i22.38959