VADER: A Parsimonious Rule-Based Model for Sentiment Analysis of Social Media Text

C. Hutto; Eric Gilbert

doi:10.1609/icwsm.v8i1.14550

Authors

C. Hutto Georgia Institute of Technology
Eric Gilbert Georgia Institute of Technology

DOI:

https://doi.org/10.1609/icwsm.v8i1.14550

Keywords:

Sentiment Analysis, Social Media, Validated Sentiment Lexicon, Human Centered Computing

Abstract

The inherent nature of social media content poses serious challenges to practical applications of sentiment analysis. We present VADER, a simple rule-based model for general sentiment analysis, and compare its effectiveness to eleven typical state-of-practice benchmarks including LIWC, ANEW, the General Inquirer, SentiWordNet, and machine learning oriented techniques relying on Naive Bayes, Maximum Entropy, and Support Vector Machine (SVM) algorithms. Using a combination of qualitative and quantitative methods, we first construct and empirically validate a gold-standard list of lexical features (along with their associated sentiment intensity measures) which are specifically attuned to sentiment in microblog-like contexts. We then combine these lexical features with consideration for five general rules that embody grammatical and syntactical conventions for expressing and emphasizing sentiment intensity. Interestingly, using our parsimonious rule-based model to assess the sentiment of tweets, we find that VADER outperforms individual human raters (F1 Classification Accuracy = 0.96 and 0.84, respectively), and generalizes more favorably across contexts than any of our benchmarks.

VADER: A Parsimonious Rule-Based Model for Sentiment Analysis of Social Media Text

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information