1.
Lipton Z, Li X, Gao J, Li L, Ahmed F, Deng L. BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems. AAAI [Internet]. 2018Apr.27 [cited 2024Mar.29];32(1). Available from: https://ojs.aaai.org/index.php/AAAI/article/view/11946