[1]
H. Lin, H. Wu, J. Zhang, Y. Sun, J. Ye, and Y. Yu, “Episodic Return Decomposition by Difference of Implicitly Assigned Sub-trajectory Reward”, AAAI, vol. 38, no. 12, pp. 13808–13816, Mar. 2024.