(1)

Hwang, T.; Oh, M.- hwan. Model-Based Reinforcement Learning With Multinomial Logistic Function Approximation. AAAI 2023, 37, 7971-7979.