[1]
H. Zou, T. Ren, D. Yan, H. Su, and J. Zhu, “Learning Task-Distribution Reward Shaping with Meta-Learning”, AAAI, vol. 35, no. 12, pp. 11210-11218, May 2021.