Li, L.-F., Zhao, P., & Zhou, Z.-H. (2024). Dynamic Regret of Adversarial MDPs with Unknown Transition and Linear Function Approximation. Proceedings of the AAAI Conference on Artificial Intelligence, 38(12), 13572-13580. https://doi.org/10.1609/aaai.v38i12.29261