Hwang, T., & Oh, M.- hwan. (2023). Model-Based Reinforcement Learning with Multinomial Logistic Function Approximation. Proceedings of the AAAI Conference on Artificial Intelligence, 37(7), 7971-7979. https://doi.org/10.1609/aaai.v37i7.25964