Meng, W. (2023) “Off-Policy Proximal Policy Optimization”, Proceedings of the AAAI Conference on Artificial Intelligence, 37(8), pp. 9162–9170. doi: 10.1609/aaai.v37i8.26099.