[1]
Zhang, S., Zhao, J., Wang, P., Wang, T., Liang, Z., Tao, J., Huang, Y. and Feng, J. 2023. Multi-Action Dialog Policy Learning from Logged User Feedback. Proceedings of the AAAI Conference on Artificial Intelligence. 37, 11 (Jun. 2023), 13976-13983. DOI:https://doi.org/10.1609/aaai.v37i11.26636.