[1]

Lin, H. et al. 2024. Episodic Return Decomposition by Difference of Implicitly Assigned Sub-trajectory Reward. Proceedings of the AAAI Conference on Artificial Intelligence. 38, 12 (Mar. 2024), 13808–13816. DOI:https://doi.org/10.1609/aaai.v38i12.29287.