Audio-Lyrics Alignment Dataset for Italian Arias

Pushkar Jajoria, Arianna Graciotti, Giovanna Casali, Jesujoba Alabi, Rodolfo Delmonte, Angelo Pompilio, Rocco Tripodi, James McDermott, Dietrich Klakow


Abstract
Aligning song lyrics with sung audio is challenging, especially for languages and music styles where annotated datasets are scarce. We address this gap by presenting the first dataset of Italian opera arias annotated with lyrics and time-stamps per word. The dataset comprises of 24 arias drawn from well-known operas of the 18th to 20th centuries with a total audio duration of nearly two hours. We benchmark both music alignment models and speech forced alignment models and show that existing methods face significant challenges on this dataset, with performance dropping by 45% compared to other datasets. Multilingual and speech-based models exhibit relatively better performance on this dataset. We also evaluate few-shot fine-tuning of these models on the new dataset and find that, while it yields only marginal overall improvement, it produces localized gains on specific arias, suggesting that limited exposure helps the model adapt to some patterns but cannot fully overcome differences in language or musical style.
Anthology ID:
2026.lrec-main.454
Volume:
Proceedings of the Fifteenth Language Resources and Evaluation Conference
Month:
May
Year:
2026
Address:
Palma de Mallorca, Spain
Editors:
Stelios Piperidis, Núria Bel, Henk van den Heuvel, Nancy Ide, Simon Krek, Antonio Toral
Venue:
LREC
SIG:
Publisher:
ELRA Language Resource Association
Note:
Pages:
5757–5766
Language:
URL:
https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.454/
DOI:
Bibkey:
Cite (ACL):
Pushkar Jajoria, Arianna Graciotti, Giovanna Casali, Jesujoba Alabi, Rodolfo Delmonte, Angelo Pompilio, Rocco Tripodi, James McDermott, and Dietrich Klakow. 2026. Audio-Lyrics Alignment Dataset for Italian Arias. International Conference on Language Resources and Evaluation, main:5757–5766.
Cite (Informal):
Audio-Lyrics Alignment Dataset for Italian Arias (Jajoria et al., LREC 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-lrec/2026.lrec-main.454.pdf