Gabriel Meseguer-Brocal
Also published as: Gabriel Meseguer Brocal
2025
Synthetic Lyrics Detection Across Languages and Genres
Yanis Labrak
|
Markus Frohmann
|
Gabriel Meseguer-Brocal
|
Elena V. Epure
Proceedings of the 5th Workshop on Trustworthy NLP (TrustNLP 2025)
In recent years, the use of large language models (LLMs) to generate music content, particularly lyrics, has gained in popularity. These advances provide valuable tools for artists and enhance their creative processes, but they also raise concerns about copyright violations, consumer satisfaction, and content spamming. Previous research has explored content detection in various domains. However, no work has focused on the text modality, lyrics, in music. To address this gap, we curated a diverse dataset of real and synthetic lyrics from multiple languages, music genres, and artists. The generation pipeline was validated using both humans and automated methods. We performed a thorough evaluation of existing synthetic text detection approaches on lyrics, a previously unexplored data type. We also investigated methods to adapt the best-performing features to lyrics through unsupervised domain adaptation. Following both music and industrial constraints, we examined how well these approaches generalize across languages, scale with data availability, handle multilingual language content, and perform on novel genres in few-shot settings. Our findings show promising results that could inform policy decisions around AI-generated music and enhance transparency for users.
2024
Proceedings of the 3rd Workshop on NLP for Music and Audio (NLP4MusA)
Anna Kruspe
|
Sergio Oramas
|
Elena V. Epure
|
Mohamed Sordo
|
Benno Weck
|
SeungHeon Doh
|
Minz Won
|
Ilaria Manco
|
Gabriel Meseguer-Brocal
Proceedings of the 3rd Workshop on NLP for Music and Audio (NLP4MusA)
Harnessing High-Level Song Descriptors towards Natural Language-Based Music Recommendation
Elena V. Epure
|
Gabriel Meseguer Brocal
|
Darius Afchar
|
Romain Hennequin
Proceedings of the 3rd Workshop on NLP for Music and Audio (NLP4MusA)
Recommender systems relying on Language Models (LMs) have gained popularity in assisting users to navigate large catalogs. LMs often exploit item high-level descriptors, i.e. categories or consumption contexts, from training data or user preferences. This has been proven effective in domains like movies or products. In music though, understanding how effectively LMs utilize song descriptors for natural language-based music recommendation is relatively limited. In this paper, we assess LMs effectiveness in recommending songs based on user natural language requests and items with descriptors like genres, moods, and listening contexts. We formulate the recommendation as a dense retrieval problem and assess LMs as they become increasingly familiar with data pertinent to the task and domain. Our findings reveal improved performance as LMs are fine-tuned for general language similarity, information retrieval, and mapping longer descriptions to shorter, high-level descriptors in music.
Search
Fix data
Co-authors
- Elena V. Epure 3
- Darius Afchar 1
- Seungheon Doh 1
- Markus Frohmann 1
- Romain Hennequin 1
- show all...