A Machine Learning Approach to Twitter User Classification

Marco Pennacchiotti; Ana-Maria Popescu

doi:10.1609/icwsm.v5i1.14139

A Machine Learning Approach to Twitter User Classification

Authors

Marco Pennacchiotti Yahoo! Labs
Ana-Maria Popescu Yahoo! Labs

DOI:

https://doi.org/10.1609/icwsm.v5i1.14139

Abstract

This paper addresses the task of user classification in social media, with an application to Twitter. We automatically infer the values of user attributes such as political orientation or ethnicity by leveraging observable information such as the user behavior, network structure and the linguistic content of the user’s Twitter feed. We employ a machine learning approach which relies on a comprehensive set of features derived from such user information. We report encouraging experimental results on 3 tasks with different characteristics: political affiliation detection, ethnicity identification and detecting affinity for a particular business. Finally, our analysis shows that rich linguistic features prove consistently valuable across the 3 tasks and show great promise for additional user classification needs.

Downloads

Published

2021-08-03

How to Cite

Pennacchiotti, M., & Popescu, A.-M. (2021). A Machine Learning Approach to Twitter User Classification. Proceedings of the International AAAI Conference on Web and Social Media, 5(1), 281-288. https://doi.org/10.1609/icwsm.v5i1.14139

Download Citation

Issue

Vol. 5 No. 1 (2011): Fifth International AAAI Conference on Weblogs and Social Media

Section

Full Papers

A Machine Learning Approach to Twitter User Classification

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information