(1)
Li, L.-F.; Zhao, P.; Zhou, Z.-H. Dynamic Regret of Adversarial MDPs With Unknown Transition and Linear Function Approximation. AAAI 2024, 38, 13572-13580.