Discovering User Attribute Stylistic Differences via Paraphrasing

Authors

  • Daniel Preotiuc-Pietro University of Pennsylvania
  • Wei Xu University of Pennsylvania
  • Lyle Ungar University of Pennsylvania

DOI:

https://doi.org/10.1609/aaai.v30i1.10393

Keywords:

User traits, Paraphrases, User profiling, User attributes, Stylistic Diferences, Natural Language Processing, Pscyholinguistics, Text mining

Abstract

User attribute prediction from social media text has proven successful and useful for downstream tasks. In previous studies, differences in user trait language use have been limited primarily to the presence or absence of words that indicate topical preferences. In this study, we aim to find linguistic style distinctions across three different user attributes: gender, age and occupational class. By combining paraphrases with a simple yet effective method, we capture a wide set of stylistic differences that are exempt from topic bias. We show their predictive power in user profiling, conformity with human perception and psycholinguistic hypotheses, and potential use in generating natural language tailored to specific user traits.

Downloads

Published

2016-03-05

How to Cite

Preotiuc-Pietro, D., Xu, W., & Ungar, L. (2016). Discovering User Attribute Stylistic Differences via Paraphrasing. Proceedings of the AAAI Conference on Artificial Intelligence, 30(1). https://doi.org/10.1609/aaai.v30i1.10393

Issue

Section

Technical Papers: NLP and Text Mining