(1)
Chen, C.; Tang, H.; Hao, J.; Liu, W.; Meng, Z. Addressing Action Oscillations through Learning Policy Inertia. AAAI 2021, 35, 7020-7027.