Emosical: An Emotion-Annotated Musical Theatre Dataset

Hayoon Kim, Ahyeon Choi, Sungho Lee, Hyun Jin Jung, Kyogu Lee


Abstract
This paper presents Emosical, a multimodal open-source dataset of musical films. Emosical comprises video, vocal audio, text, and character identity paired samples with annotated emotion tags. Emosical provides rich emotion annotations for each sample by inferring the background story of the characters. To achieve this, we leverage the musical theatre script, which contains the characters’ complete background stories and narrative contexts. The annotation pipeline includes feeding the speaking character, text, global persona, and context of the dialogue and song track into a large language model. To verify the effectiveness of our tagging scheme, we perform an ablation study by bypassing each step of the pipeline. The ablation results show the usefulness of each component in generating accurate emotion tags. A subjective test is conducted to compare the generated tags of each ablation result. We also perform a statistical analysis to find out the global characteristics of the collected emotion tags. Emosical would enable expressive synthesis and tagging of the speech and singing voice in the musical theatre domain in future research. Emosical is publicly available at https://github.com/gillosae/emosical.
Anthology ID:
2024.findings-emnlp.241
Volume:
Findings of the Association for Computational Linguistics: EMNLP 2024
Month:
November
Year:
2024
Address:
Miami, Florida, USA
Editors:
Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
4169–4180
Language:
URL:
https://preview.aclanthology.org/add-emnlp-2024-awards/2024.findings-emnlp.241/
DOI:
10.18653/v1/2024.findings-emnlp.241
Bibkey:
Cite (ACL):
Hayoon Kim, Ahyeon Choi, Sungho Lee, Hyun Jin Jung, and Kyogu Lee. 2024. Emosical: An Emotion-Annotated Musical Theatre Dataset. In Findings of the Association for Computational Linguistics: EMNLP 2024, pages 4169–4180, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):
Emosical: An Emotion-Annotated Musical Theatre Dataset (Kim et al., Findings 2024)
Copy Citation:
PDF:
https://preview.aclanthology.org/add-emnlp-2024-awards/2024.findings-emnlp.241.pdf