Characterizing Geographic Variation in Well-Being Using Tweets

Hansen Schwartz; Johannes Eichstaedt; Margaret Kern; Lukasz Dziurzynski; Richard Lucas; Megha Agrawal; Gregory Park; Shrinidhi Lakshmikanth; Sneha Jha; Martin Seligman; Lyle Ungar

doi:10.1609/icwsm.v7i1.14442

Authors

Hansen Schwartz University of Pennsylvania
Johannes Eichstaedt University of Pennsylvania
Margaret Kern University of Pennsylvania
Lukasz Dziurzynski University of Pennsylvania
Richard Lucas Michigan State University
Megha Agrawal University of Pennsylvania
Gregory Park University of Pennsylvania
Shrinidhi Lakshmikanth University of Pennsylvania
Sneha Jha University of Pennsylvania
Martin Seligman University of Pennsylvania
Lyle Ungar University of Pennsylvania

DOI:

https://doi.org/10.1609/icwsm.v7i1.14442

Keywords:

well-being, social media, natural language processing, Twitter

Abstract

The language used in tweets from 1,300 different US counties was found to be predictive of the subjective well-being of people living in those counties as measured by representative surveys. Topics, sets of co-occurring words derived from the tweets using LDA, improved accuracy in predicting life satisfaction over and above standard demographic and socio-economic controls (age, gender, ethnicity, income, and education). The LDA topics provide a greater behavioural and conceptual resolution into life satisfaction than the broad socio-economic and demographic variables. For example, tied in with the psychological literature, words relating to outdoor activities, spiritual meaning, exercise, and good jobs correlate with increased life satisfaction, while words signifying disengagement like ’bored’ and ’tired’ show a negative association.

Characterizing Geographic Variation in Well-Being Using Tweets

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information