Language Matters In Twitter: A Large Scale Study

Authors

  • Lichan Hong Palo Alto Research Center (PARC)
  • Gregorio Convertino Palo Alto Research Center (PARC)
  • Ed Chi Google

DOI:

https://doi.org/10.1609/icwsm.v5i1.14184

Abstract

Despite the widespread adoption of Twitter internationally, little research has investigated the differences among users of different languages. In prior research, the natural tendency has been to assume that the behaviors of English users generalize to other language users. We studied 62 million tweets collected over a four-week period and found that more than 100 languages were used. Only half of the tweets were in English (51%). Other popular languages including Japanese, Portuguese, Indonesian, and Spanish together accounted for 39% of the tweets. Examining users of the top 10 languages, we discovered cross-language differences in adoption of features such as URLs, hashtags, mentions, replies, and retweets. We discuss our work’s implications for research on large-scale social systems and design of cross-cultural communication tools.

Downloads

Published

2021-08-03

How to Cite

Hong, L., Convertino, G., & Chi, E. (2021). Language Matters In Twitter: A Large Scale Study. Proceedings of the International AAAI Conference on Web and Social Media, 5(1), 518-521. https://doi.org/10.1609/icwsm.v5i1.14184