Talk Funny! A Large-Scale Humor Response Dataset with Chain-of-Humor Interpretation

Authors

  • Yuyan Chen Shanghai Key Laboratory of Data Science, School of Computer Science, Fudan University
  • Yichen Yuan Institute of Automation, Chinese Academy of Sciences
  • Panjun Liu School of Computer Science, Beijing Institute of Technology
  • Dayiheng Liu Alibaba DAMO Academy
  • Qinghao Guan University of Zurich
  • Mengfei Guo Beijing Jiaotong University
  • Haiming Peng Shanghai Key Laboratory of Data Science, School of Computer Science, Fudan University
  • Bang Liu RALI \& Mila, Université de Montréal
  • Zhixu Li Shanghai Key Laboratory of Data Science, School of Computer Science, Fudan University
  • Yanghua Xiao Shanghai Key Laboratory of Data Science, School of Computer Science, Fudan University Fudan-Aishu Cognitive Intelligence Joint Research Center, Shanghai, China

DOI:

https://doi.org/10.1609/aaai.v38i16.29736

Keywords:

NLP: (Large) Language Models, NLP: Conversational AI/Dialog Systems, NLP: Generation, NLP: Interpretability, Analysis, and Evaluation of NLP Models

Abstract

Humor is a crucial part of human communication. Understanding humor and generating humorous responses in dialogue can provide natural and empathic human-computer interactions. However, most existing pre-trained language models (PLMs) perform unsatisfactorily in humor generation. On one hand, the serious shortage of humor corpus and datasets pose challenges for constructing models that can understand and generate humorous expressions. On the other hand, humor generation relies on rich knowledge and commonsense, which is often tacit and unspoken. In this paper, we construct the largest Chinese Explainable Humor Response Dataset to date with chain-of-humor and humor mind map annotations, which can be used to comprehensively evaluate as well as improve the humorous response ability of PLMs. We further design humor-related auxiliary tasks to further enhance PLMs' humorous response performance. Extensive evaluations demonstrate that our proposed dataset and auxiliary tasks effectively help PLMs to generate humorous responses, laying the groundwork for future humor research.

Published

2024-03-24

How to Cite

Chen, Y., Yuan, Y., Liu, P., Liu, D., Guan, Q., Guo, M., Peng, H., Liu, B., Li, Z., & Xiao, Y. (2024). Talk Funny! A Large-Scale Humor Response Dataset with Chain-of-Humor Interpretation. Proceedings of the AAAI Conference on Artificial Intelligence, 38(16), 17826-17834. https://doi.org/10.1609/aaai.v38i16.29736

Issue

Section

AAAI Technical Track on Natural Language Processing I