Kim, Woosung, Donghyeon Ki, and Byung-Jun Lee. “Relaxed Stationary Distribution Correction Estimation for Improved Offline Policy Optimization”. Proceedings of the AAAI Conference on Artificial Intelligence 38, no. 12 (March 24, 2024): 13185-13192. Accessed November 2, 2024. https://ojs.aaai.org/index.php/AAAI/article/view/29218.