[1]
Li, L.-F., Zhao, P. and Zhou, Z.-H. 2024. Dynamic Regret of Adversarial MDPs with Unknown Transition and Linear Function Approximation. Proceedings of the AAAI Conference on Artificial Intelligence. 38, 12 (Mar. 2024), 13572-13580. DOI:https://doi.org/10.1609/aaai.v38i12.29261.