(1)
Lin, H.; Wu, H.; Zhang, J.; Sun, Y.; Ye, J.; Yu, Y. Episodic Return Decomposition by Difference of Implicitly Assigned Sub-Trajectory Reward. AAAI 2024, 38, 13808-13816.