(1)
Liu, B.; He, J.; Shi, H.; Wang, E.; Han, W.; Hao, J.; Wang, P.; Zhang, Z. CHDP: Cooperative Hybrid Diffusion Policies for Reinforcement Learning in Parameterized Action Space. AAAI 2026, 40, 23640-23648.