[1]
Z. Lipton, X. Li, J. Gao, L. Li, F. Ahmed, and L. Deng, “BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems”, AAAI, vol. 32, no. 1, Apr. 2018.