Zhang, S., Zhao, J., Wang, P., Wang, T., Liang, Z., Tao, J., Huang, Y. and Feng, J. (2023) “Multi-Action Dialog Policy Learning from Logged User Feedback”, Proceedings of the AAAI Conference on Artificial Intelligence, 37(11), pp. 13976-13983. doi: 10.1609/aaai.v37i11.26636.