Meng, Wenjia, Qian Zheng, Gang Pan, and Yilong Yin. 2023. “Off-Policy Proximal Policy Optimization”. Proceedings of the AAAI Conference on Artificial Intelligence 37 (8):9162-70. https://doi.org/10.1609/aaai.v37i8.26099.