Comparing Automated Methods to Detect Explicit Content in Song Lyrics
Abstract
The Parental Advisory Label (PAL) is a warning label that is placed on audio recordings in recognition of profanity or inappropriate references, with the intention of alerting parents of material potentially unsuitable for children. Since 2015, digital providers – such as iTunes, Spotify, Amazon Music and Deezer – also follow PAL guidelines and tag such tracks as “explicit”. Nowadays, such labelling is carried out mainly manually on voluntary basis, with the drawbacks of being time consuming and therefore costly, error prone and partly a subjective task. In this paper, we compare automated methods ranging from dictionary-based lookup to state-of-the-art deep neural networks to automatically detect explicit contents in English lyrics. We show that more complex models perform only slightly better on this task, and relying on a qualitative analysis of the data, we discuss the inherent hardness and subjectivity of the task.- Anthology ID:
- R19-1039
- Volume:
- Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2019)
- Month:
- September
- Year:
- 2019
- Address:
- Varna, Bulgaria
- Editors:
- Ruslan Mitkov, Galia Angelova
- Venue:
- RANLP
- SIG:
- Publisher:
- INCOMA Ltd.
- Note:
- Pages:
- 338–344
- Language:
- URL:
- https://aclanthology.org/R19-1039
- DOI:
- 10.26615/978-954-452-056-4_039
- Cite (ACL):
- Michael Fell, Elena Cabrio, Michele Corazza, and Fabien Gandon. 2019. Comparing Automated Methods to Detect Explicit Content in Song Lyrics. In Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2019), pages 338–344, Varna, Bulgaria. INCOMA Ltd..
- Cite (Informal):
- Comparing Automated Methods to Detect Explicit Content in Song Lyrics (Fell et al., RANLP 2019)
- PDF:
- https://preview.aclanthology.org/teach-a-man-to-fish/R19-1039.pdf