Zhang, S., Zhao, J., Wang, P., Wang, T., Liang, Z., Tao, J., Huang, Y., & Feng, J. (2023). Multi-Action Dialog Policy Learning from Logged User Feedback. Proceedings of the AAAI Conference on Artificial Intelligence, 37(11), 13976-13983. https://doi.org/10.1609/aaai.v37i11.26636