Li, L.-F., P. Zhao, and Z.-H. Zhou. “Dynamic Regret of Adversarial MDPs With Unknown Transition and Linear Function Approximation”. Proceedings of the AAAI Conference on Artificial Intelligence, vol. 38, no. 12, Mar. 2024, pp. 13572-80, doi:10.1609/aaai.v38i12.29261.