[1]
Z. Li, J. Kiseleva, and M. de Rijke, “Dialogue Generation: From Imitation Learning to Inverse Reinforcement Learning”, AAAI, vol. 33, no. 01, pp. 6722-6729, Jul. 2019.