Dialogues Are Not Just Text: Modeling Cognition for Dialogue Coherence Evaluation

Authors

  • Xue Li Harbin Institute of Technology
  • Jia Su Huawei Cloud
  • Yang Yang Harbin Institute of Technology
  • Zipeng Gao University of Science and Technology of China
  • Xinyu Duan Huawei Cloud
  • Yi Guan Harbin Institute of Technology

DOI:

https://doi.org/10.1609/aaai.v38i17.29819

Keywords:

NLP: Conversational AI/Dialog Systems, CMS: Other Foundations of Cognitive Modeling & Systems, NLP: Interpretability, Analysis, and Evaluation of NLP Models

Abstract

The generation of logically coherent dialogues by humans relies on underlying cognitive abilities. Based on this, we redefine the dialogue coherence evaluation process, combining cognitive judgment with the basic text to achieve a more human-like evaluation. We propose a novel dialogue evaluation framework based on Dialogue Cognition Graph (DCGEval) to implement the fusion by in-depth interaction between cognition modeling and text modeling. The proposed Abstract Meaning Representation (AMR) based graph structure called DCG aims to uniformly model four dialogue cognitive abilities. Specifically, core-semantic cognition is modeled by converting the utterance into an AMR graph, which can extract essential semantic information without redundancy. The temporal and role cognition are modeled by establishing logical relationships among the different AMR graphs. Finally, the commonsense knowledge from ConceptNet is fused to express commonsense cognition. Experiments demonstrate the necessity of modeling human cognition for dialogue evaluation, and our DCGEval presents stronger correlations with human judgments compared to other state-of-the-art evaluation metrics.

Published

2024-03-24

How to Cite

Li, X., Su, J., Yang, Y., Gao, Z., Duan, X., & Guan, Y. (2024). Dialogues Are Not Just Text: Modeling Cognition for Dialogue Coherence Evaluation. Proceedings of the AAAI Conference on Artificial Intelligence, 38(17), 18573-18581. https://doi.org/10.1609/aaai.v38i17.29819

Issue

Section

AAAI Technical Track on Natural Language Processing II