Abstract
Language generation tasks that seek to mimic human ability to use language creatively are difficult to evaluate, since one must consider creativity, style, and other non-trivial aspects of the generated text. The goal of this paper is to develop evaluations methods for one such task, ghostwriting of rap lyrics, and to provide an explicit, quantifiable foundation for the goals and future directions for this task. Ghostwriting must produce text that is similar in style to the emulated artist, yet distinct in content. We develop a novel evaluation methodology that addresses several complementary aspects of this task, and illustrate how such evaluation can be used to meaning fully analyze system performance. We provide a corpus of lyrics for 13 rap artists, annotated for stylistic similarity, which allows us to assess the feasibility of manual evaluation for generated verse.- Anthology ID:
- W18-1604
- Volume:
- Proceedings of the Second Workshop on Stylistic Variation
- Month:
- June
- Year:
- 2018
- Address:
- New Orleans
- Venue:
- Style-Var
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 29–38
- Language:
- URL:
- https://aclanthology.org/W18-1604
- DOI:
- 10.18653/v1/W18-1604
- Cite (ACL):
- Peter Potash, Alexey Romanov, and Anna Rumshisky. 2018. Evaluating Creative Language Generation: The Case of Rap Lyric Ghostwriting. In Proceedings of the Second Workshop on Stylistic Variation, pages 29–38, New Orleans. Association for Computational Linguistics.
- Cite (Informal):
- Evaluating Creative Language Generation: The Case of Rap Lyric Ghostwriting (Potash et al., Style-Var 2018)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/W18-1604.pdf