1.
Bai F, Zhang H, Tao T, Wu Z, Wang Y, Xu B. PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction. AAAI [Internet]. 2023Jun.26 [cited 2024Aug.9];37(6):6728-36. Available from: https://ojs.aaai.org/index.php/AAAI/article/view/25825