Bai, Fengshuo, Hongming Zhang, Tianyang Tao, Zhiheng Wu, Yanna Wang, and Bo Xu. “PiCor: Multi-Task Deep Reinforcement Learning With Policy Correction”. Proceedings of the AAAI Conference on Artificial Intelligence 37, no. 6 (June 26, 2023): 6728-6736. Accessed April 25, 2026. https://ojs.aaai.org/index.php/AAAI/article/view/25825.