(1)
Meng, W.; Zheng, Q.; Pan, G.; Yin, Y. Off-Policy Proximal Policy Optimization. AAAI 2023, 37, 9162-9170.