ChipSong: A Controllable Lyric Generation System for Chinese Popular Song

Nayu Liu, Wenjing Han, Guangcan Liu, Da Peng, Ran Zhang, Xiaorui Wang, Huabin Ruan


Abstract
In this work, we take a further step towards satisfying practical demands in Chinese lyric generation from musical short-video creators, in respect of the challenges on songs’ format constraints, creating specific lyrics from open-ended inspiration inputs, and language rhyme grace. One representative detail in these demands is to control lyric format at word level, that is, for Chinese songs, creators even expect fix-length words on certain positions in a lyric to match a special melody, while previous methods lack such ability. Although recent lyric generation community has made gratifying progress, most methods are not comprehensive enough to simultaneously meet these demands. As a result, we propose ChipSong, which is an assisted lyric generation system built based on a Transformer-based autoregressive language model architecture, and generates controlled lyric paragraphs fit for musical short-video display purpose, by designing 1) a novel Begin-Internal-End (BIE) word-granularity embedding sequence with its guided attention mechanism for word-level length format control, and an explicit symbol set for sentence-level length format control; 2) an open-ended trigger word mechanism to guide specific lyric contents generation; 3) a paradigm of reverse order training and shielding decoding for rhyme control. Extensive experiments show that our ChipSong generates fluent lyrics, with assuring the high consistency to pre-determined control conditions.
Anthology ID:
2022.in2writing-1.13
Volume:
Proceedings of the First Workshop on Intelligent and Interactive Writing Assistants (In2Writing 2022)
Month:
May
Year:
2022
Address:
Dublin, Ireland
Editors:
Ting-Hao 'Kenneth' Huang, Vipul Raheja, Dongyeop Kang, John Joon Young Chung, Daniel Gissin, Mina Lee, Katy Ilonka Gero
Venue:
In2Writing
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
85–95
Language:
URL:
https://aclanthology.org/2022.in2writing-1.13
DOI:
10.18653/v1/2022.in2writing-1.13
Bibkey:
Cite (ACL):
Nayu Liu, Wenjing Han, Guangcan Liu, Da Peng, Ran Zhang, Xiaorui Wang, and Huabin Ruan. 2022. ChipSong: A Controllable Lyric Generation System for Chinese Popular Song. In Proceedings of the First Workshop on Intelligent and Interactive Writing Assistants (In2Writing 2022), pages 85–95, Dublin, Ireland. Association for Computational Linguistics.
Cite (Informal):
ChipSong: A Controllable Lyric Generation System for Chinese Popular Song (Liu et al., In2Writing 2022)
Copy Citation:
PDF:
https://preview.aclanthology.org/nschneid-patch-3/2022.in2writing-1.13.pdf
Video:
 https://preview.aclanthology.org/nschneid-patch-3/2022.in2writing-1.13.mp4
Code
 korokes/chipsong