Integrating Pre-trained Model into Rule-based Dialogue Management

Authors

  • Jun Quan Soochow University Microsoft STCA
  • Meng Yang Microsoft STCA
  • Qiang Gan Microsoft STCA
  • Deyi Xiong Soochow University
  • Yiming Liu Microsoft STCA
  • Yuchen Dong Microsoft STCA
  • Fangxin Ouyang Microsoft STCA
  • Jun Tian Microsoft STCA
  • Ruiling Deng Microsoft STCA
  • Yongzhi Li Microsoft STCA
  • Yang Yang Microsoft STCA
  • Daxin Jiang Microsoft STCA

DOI:

https://doi.org/10.1609/aaai.v35i18.18023

Keywords:

Dialogue Management, Pre-training Method, Few-shot Capability, Scalability, Reduce Manual Efforts

Abstract

Rule-based dialogue management is still the most popular solution for industrial task-oriented dialogue systems for their interpretablility. However, it is hard for developers to maintain the dialogue logic when the scenarios get more and more complex. On the other hand, data-driven dialogue systems, usually with end-to-end structures, are popular in academic research and easier to deal with complex conversations, but such methods require plenty of training data and the behaviors are less interpretable. In this paper, we propose a method to leverages the strength of both rule-based and data-driven dialogue managers (DM). We firstly introduce the DM of Carina Dialog System (CDS, an advanced industrial dialogue system built by Microsoft). Then we propose the "model-trigger" design to make the DM trainable thus scalable to scenario changes. Furthermore, we integrate pre-trained models and empower the DM with few-shot capability. The experimental results demonstrate the effectiveness and strong few-shot capability of our method.

Downloads

Published

2021-05-18

How to Cite

Quan, J., Yang, M., Gan, Q., Xiong, D., Liu, Y., Dong, Y., Ouyang, F., Tian, J., Deng, R., Li, Y., Yang, Y., & Jiang, D. (2021). Integrating Pre-trained Model into Rule-based Dialogue Management. Proceedings of the AAAI Conference on Artificial Intelligence, 35(18), 16097-16099. https://doi.org/10.1609/aaai.v35i18.18023