The Contribution of Lyrics and Acoustics to Collaborative Understanding of Mood

Shahrzad Naseri; Sravana Reddy; Joana Correia; Jussi Karlgren; Rosie Jones

doi:10.1609/icwsm.v16i1.19326

Authors

Shahrzad Naseri University of Massachusets Amherst
Sravana Reddy ASAPP
Joana Correia Spotify
Jussi Karlgren Spotify
Rosie Jones Spotify

DOI:

https://doi.org/10.1609/icwsm.v16i1.19326

Keywords:

Text categorization; topic recognition; demographic/gender/age identification, Qualitative and quantitative studies of social media, Subjectivity in textual data; sentiment analysis; polarity/opinion identification and extraction, linguistic analyses of social media behavior

Abstract

In this work, we study the association between song lyrics and mood through a data-driven analysis. Our data set consists of nearly one million songs, with song-mood associations derived from user playlists on the Spotify streaming platform. We take advantage of state-of-the-art natural language processing models based on transformers to learn the association between the lyrics and moods. We find that a pretrained transformer-based language model in a zero-shot setting -- i.e., out of the box with no further training on our data -- is powerful for capturing song-mood associations. Moreover, we illustrate that training on song-mood associations results in a highly accurate model that predicts these associations for unseen songs. Furthermore, by comparing the prediction of a model using lyrics with one using acoustic features, we observe that the relative importance of lyrics for mood prediction in comparison with acoustics depends on the specific mood. Finally, we verify if the models are capturing the same information about lyrics and acoustics as humans through an annotation task where we obtain human judgments of mood-song relevance based on lyrics and acoustics.

The Contribution of Lyrics and Acoustics to Collaborative Understanding of Mood

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information