A Machine Learning Approach to Twitter User Classification

Authors

  • Marco Pennacchiotti Yahoo! Labs
  • Ana-Maria Popescu Yahoo! Labs

Abstract

This paper addresses the task of user classification in social media, with an application to Twitter. We automatically infer the values of user attributes such as political orientation or ethnicity by leveraging observable information such as the user behavior, network structure and the linguistic content of the user’s Twitter feed. We employ a machine learning approach which relies on a comprehensive set of features derived from such user information. We report encouraging experimental results on 3 tasks with different characteristics: political affiliation detection, ethnicity identification and detecting affinity for a particular business. Finally, our analysis shows that rich linguistic features prove consistently valuable across the 3 tasks and show great promise for additional user classification needs.

Downloads

Published

2021-08-03

How to Cite

Pennacchiotti, M., & Popescu, A.-M. (2021). A Machine Learning Approach to Twitter User Classification. Proceedings of the International AAAI Conference on Web and Social Media, 5(1), 281-288. Retrieved from https://ojs.aaai.org/index.php/ICWSM/article/view/14139