A Parallel Corpus of Music and Lyrics Annotated with Emotions

Carlo Strapparava, Rada Mihalcea, Alberto Battocchi


Abstract
In this paper, we introduce a novel parallel corpus of music and lyrics, annotated with emotions at line level. We first describe the corpus, consisting of 100 popular songs, each of them including a music component, provided in the MIDI format, as well as a lyrics component, made available as raw text. We then describe our work on enhancing this corpus with emotion annotations using crowdsourcing. We also present some initial experiments on emotion classification using the music and the lyrics representations of the songs, which lead to encouraging results, thus demonstrating the promise of using joint music-lyric models for song processing.
Anthology ID:
L12-1425
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
2343–2346
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/730_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Carlo Strapparava, Rada Mihalcea, and Alberto Battocchi. 2012. A Parallel Corpus of Music and Lyrics Annotated with Emotions. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 2343–2346, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
A Parallel Corpus of Music and Lyrics Annotated with Emotions (Strapparava et al., LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/730_Paper.pdf