Analyzing Gambling Addictions: A Spanish Corpus for Understanding Pathological Behavior

Manuel Couto, Marcos Fernández-Pichel, Mario Ezra Aragon, David E. Losada


Abstract
This work fosters research on the interaction between natural language use and gambling disorders. We have built a new Spanish corpus for screening standardized gambling symptoms. We employ search methods to find on-topic sentences, top-k pooling to form the assessment pools of sentences, and thorough annotation guidelines. The labeling task is challenging, given the need to identify topic relevance and explicit evidence about the symptoms. Additionally, we explore using state-of-the-art LLMs for annotation and compare different sentence search models.
Anthology ID:
2025.findings-emnlp.955
Volume:
Findings of the Association for Computational Linguistics: EMNLP 2025
Month:
November
Year:
2025
Address:
Suzhou, China
Editors:
Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
17610–17619
Language:
URL:
https://preview.aclanthology.org/ingest-luhme/2025.findings-emnlp.955/
DOI:
10.18653/v1/2025.findings-emnlp.955
Bibkey:
Cite (ACL):
Manuel Couto, Marcos Fernández-Pichel, Mario Ezra Aragon, and David E. Losada. 2025. Analyzing Gambling Addictions: A Spanish Corpus for Understanding Pathological Behavior. In Findings of the Association for Computational Linguistics: EMNLP 2025, pages 17610–17619, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):
Analyzing Gambling Addictions: A Spanish Corpus for Understanding Pathological Behavior (Couto et al., Findings 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-luhme/2025.findings-emnlp.955.pdf
Checklist:
 2025.findings-emnlp.955.checklist.pdf