Language Use Matters: Analysis of the Linguistic Structure of Question Texts Can Characterize Answerability in Quora

Authors

  • Suman Maity Indian Institute of Technology Kharagpur
  • Aman Kharb Indian Institute of Technology Kharagpur
  • Animesh Mukherjee Indian Institute of Technology Kharagpur

Abstract

Quora is one of the most popular community Q and A sites of recent times. However, many question posts on this Q and A site often do not get answered. In this paper, we quantify various linguistic activities that discriminates an answered question from an unanswered one. Our central finding is that the way users use language while writing the question text can be a very effective means to characterize answerability. This characterization helps us to predict early if a question remaining unanswered for a specific time period t will eventually be answered or not and achieve an accuracy of 76.26% (t=1 month) and 68.33% (t=3 months). Notably, features representing the language use patterns of the users are most discriminative and alone account for an accuracy of 74.18%. We also compare our method with some of the similar works (Dror et al., Yang et al.) achieving a maximum improvement of ~39% in terms of accuracy.

Downloads

Published

2017-05-03

How to Cite

Maity, S., Kharb, A., & Mukherjee, A. (2017). Language Use Matters: Analysis of the Linguistic Structure of Question Texts Can Characterize Answerability in Quora. Proceedings of the International AAAI Conference on Web and Social Media, 11(1), 612-615. Retrieved from https://ojs.aaai.org/index.php/ICWSM/article/view/14947