Constructing a Japanese Rap Lyric Generation Model with GRPO

Hayato Ogawa, Daisuke Kawahara


Abstract
Rap is a vocal style rooted in Hip-Hop culture, characterized by producing rhymes in synchrony with a rhythmic beat.This paper proposes a method for generating Japanese rap lyrics with a large language model (LLM) whose rhyming behavior is improved via reinforcement learning.We design a reward function that evaluates end rhymes between two generated bars and apply GRPO, a reinforcement-learning method, to encourage Japanese rhyming without using existing Japanese rap lyrics as training data.Experimental results show that, although output collapse is observed in some cases, GRPO increases the proportion of outputs that receive moderate or high human ratings on rhyme-related criteria.
Anthology ID:
2026.acl-srw.27
Volume:
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Santosh T.Y.S.S., Juan Diego Rodriguez, Ona de Gibert
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
340–351
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.acl-srw.27/
DOI:
Bibkey:
Cite (ACL):
Hayato Ogawa and Daisuke Kawahara. 2026. Constructing a Japanese Rap Lyric Generation Model with GRPO. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026), pages 340–351, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
Constructing a Japanese Rap Lyric Generation Model with GRPO (Ogawa & Kawahara, ACL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.acl-srw.27.pdf