Lin, H., Wu, H., Zhang, J., Sun, Y., Ye, J., & Yu, Y. (2024). Episodic Return Decomposition by Difference of Implicitly Assigned Sub-trajectory Reward. Proceedings of the AAAI Conference on Artificial Intelligence, 38(12), 13808–13816. https://doi.org/10.1609/aaai.v38i12.29287