(1)
Bai, F.; Zhang, H.; Tao, T.; Wu, Z.; Wang, Y.; Xu, B. PiCor: Multi-Task Deep Reinforcement Learning With Policy Correction. AAAI 2023, 37, 6728-6736.