Talk Funny! A Large-Scale Humor Response Dataset with Chain-of-Humor Interpretation

Yuyan Chen; Yichen Yuan; Panjun Liu; Dayiheng Liu; Qinghao Guan; Mengfei Guo; Haiming Peng; Bang Liu; Zhixu Li; Yanghua Xiao

doi:10.1609/aaai.v38i16.29736

Authors

Yuyan Chen Shanghai Key Laboratory of Data Science, School of Computer Science, Fudan University
Yichen Yuan Institute of Automation, Chinese Academy of Sciences
Panjun Liu School of Computer Science, Beijing Institute of Technology
Dayiheng Liu Alibaba DAMO Academy
Qinghao Guan University of Zurich
Mengfei Guo Beijing Jiaotong University
Haiming Peng Shanghai Key Laboratory of Data Science, School of Computer Science, Fudan University
Bang Liu RALI \& Mila, Université de Montréal
Zhixu Li Shanghai Key Laboratory of Data Science, School of Computer Science, Fudan University
Yanghua Xiao Shanghai Key Laboratory of Data Science, School of Computer Science, Fudan University Fudan-Aishu Cognitive Intelligence Joint Research Center, Shanghai, China

DOI:

https://doi.org/10.1609/aaai.v38i16.29736

Keywords:

NLP: (Large) Language Models, NLP: Conversational AI/Dialog Systems, NLP: Generation, NLP: Interpretability, Analysis, and Evaluation of NLP Models

Abstract

Humor is a crucial part of human communication. Understanding humor and generating humorous responses in dialogue can provide natural and empathic human-computer interactions. However, most existing pre-trained language models (PLMs) perform unsatisfactorily in humor generation. On one hand, the serious shortage of humor corpus and datasets pose challenges for constructing models that can understand and generate humorous expressions. On the other hand, humor generation relies on rich knowledge and commonsense, which is often tacit and unspoken. In this paper, we construct the largest Chinese Explainable Humor Response Dataset to date with chain-of-humor and humor mind map annotations, which can be used to comprehensively evaluate as well as improve the humorous response ability of PLMs. We further design humor-related auxiliary tasks to further enhance PLMs' humorous response performance. Extensive evaluations demonstrate that our proposed dataset and auxiliary tasks effectively help PLMs to generate humorous responses, laying the groundwork for future humor research.

Talk Funny! A Large-Scale Humor Response Dataset with Chain-of-Humor Interpretation

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Subscription