A Data-Driven Approach to Question Subjectivity Identification in Community Question Answering

Authors

  • Tom Chao Zhou The Chinese University of Hong Kong
  • Xiance Si Google
  • Edward Y. Chang Google
  • Irwin King ATT
  • Michael R. Lyu The Chinese University of Hong Kong

DOI:

https://doi.org/10.1609/aaai.v26i1.8111

Keywords:

Community Question Answering, Question Classification, Data Driven Approach, Social Signal

Abstract

Automatic Subjective Question Answering (ASQA), which aims at answering users'subjective questions using summaries of multiple opinions, becomes increasingly important. One challenge of ASQA is that expected answers for subjective questions may not readily exist in the Web. The rising and popularity of Community Question Answering (CQA) sites, which provide platforms for people to post and answer questions, provides an alternative to ASQA. One important task of ASQA is question subjectivity identification, which identifies whether a user is asking a subjective question. Unfortunately, there has been little labeled training data available for this task. In this paper, we propose an approach to collect training data automatically by utilizing social signals in CQA sites without involving any manual labeling. Experimental results show that our data-driven approach achieves 9.37% relative improvement over the supervised approach using manually labeled data, and achieves 5.15% relative gain over a state-of-the-art semi-supervised approach. In addition, we propose several heuristic features for question subjectivity identification. By adding these features, we achieve 11.23% relative improvement over word n-gram feature under the same experimental setting.

Downloads

Published

2021-09-20

How to Cite

Zhou, T. C., Si, X., Chang, E. Y., King, I., & Lyu, M. R. (2021). A Data-Driven Approach to Question Subjectivity Identification in Community Question Answering. Proceedings of the AAAI Conference on Artificial Intelligence, 26(1), 164-170. https://doi.org/10.1609/aaai.v26i1.8111