[1]
L.-F. Li, P. Zhao, and Z.-H. Zhou, “Dynamic Regret of Adversarial MDPs with Unknown Transition and Linear Function Approximation”, AAAI, vol. 38, no. 12, pp. 13572-13580, Mar. 2024.