Who Is Missing? Characterizing the Participation of Different Demographic Groups in a Korean Nationwide Daily Conversation Corpus

Authors

  • Haewoon Kwak Singapore Management University
  • Jisun An Singapore Management University
  • Kunwoo Park Soongsil University

DOI:

https://doi.org/10.1609/icwsm.v16i1.19397

Keywords:

Text categorization; topic recognition; demographic/gender/age identification, Human computer interaction; social media tools; navigation and visualization, New social media applications; interfaces; interaction techniques, Psychological, personality-based and ethnographic studies of social media

Abstract

A conversation corpus is essential to build interactive AI applications. However, the demographic information of the participants in such corpora is largely underexplored mainly due to the lack of individual data in many corpora. In this work, we analyze a Korean nationwide daily conversation corpus constructed by the National Institute of Korean Language (NIKL) to characterize the participation of different demographic (age and sex) groups in the corpus.

Downloads

Published

2022-05-31

How to Cite

Kwak, H., An, J., & Park, K. (2022). Who Is Missing? Characterizing the Participation of Different Demographic Groups in a Korean Nationwide Daily Conversation Corpus. Proceedings of the International AAAI Conference on Web and Social Media, 16(1), 1409-1413. https://doi.org/10.1609/icwsm.v16i1.19397