Meng, W., Q. Zheng, G. Pan, and Y. Yin. “Off-Policy Proximal Policy Optimization”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 37, no. 8, June 2023, pp. 9162-70, doi:10.1609/aaai.v37i8.26099.