A Disentangled-Attention Based Framework with Persona-Aware Prompt Learning for Dialogue Generation

Pingsheng Liu; Zhengjie Huang; Xiechi Zhang; Linlin Wang; Gerard de Melo; Xin Lin; Liang Pang; Liang He

doi:10.1609/aaai.v37i11.26556

Authors

Pingsheng Liu East China Normal University
Zhengjie Huang East China Normal University
Xiechi Zhang East China Normal University
Linlin Wang East China Normal University
Gerard de Melo Hasso Plattner Institute, University of Potsdam
Xin Lin East China Normal University
Liang Pang Institute of Computing Technology, Chinese Academy of Sciences
Liang He East China Normal University

DOI:

https://doi.org/10.1609/aaai.v37i11.26556

Keywords:

SNLP: Generation

Abstract

Endowing dialogue agents with personas is the key to delivering more human-like conversations. However, existing persona-grounded dialogue systems still lack informative details of human conversations and tend to reply with inconsistent and generic responses. One of the main underlying causes is that pre-defined persona sentences are generally short and merely superficial descriptions of personal attributes, making appropriate persona selection and understanding non-trivial. Another challenge is that it is crucial to consider the context and the conversation flow to dynamically determine when to invoke different types of persona signals. To address these problems, we propose a disentangled-attention based pre-training architecture, which incorporates persona-aware prompt learning to bridge the connection between the selected persona and response generation. Our model first exploits the conversation flow to select context-relevant personas, and subsequently enriches the superficial persona descriptions with extra personality traits through persona-aware prompting. Finally, the decoder leverages a disentangled-attention mechanism to flexibly control the reliance on personas and dialogue contexts, and incorporates A*-like keyword-based heuristic estimates for controllable generation. Extensive experiments show that our approach can outperform strong baselines and deliver more consistent and engaging responses on the PERSONA-CHAT dataset.

A Disentangled-Attention Based Framework with Persona-Aware Prompt Learning for Dialogue Generation

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Subscription