Terry Lima Ruas


2025

pdf bib
Citation Amnesia: On The Recency Bias of NLP and Other Academic Fields
Jan Philip Wahle | Terry Lima Ruas | Mohamed Abdalla | Bela Gipp | Saif M. Mohammad
Proceedings of the 31st International Conference on Computational Linguistics

This study examines the tendency to cite older work across 20 fields of study over 43 years (1980–2023). We put NLP’s propensity to cite older work in the context of these 20 other fields to analyze whether NLP shows similar temporal citation patterns to them over time or whether differences can be observed. Our analysis, based on a dataset of ~240 million papers, reveals a broader scientific trend: many fields have markedly declined in citing older works (e.g., psychology, computer science). The trend is strongest in NLP and ML research (-12.8% and -5.5% in citation age from previous peaks). Our results suggest that citing more recent works is not directly driven by the growth in publication rates (-3.4% across fields; -5.2% in humanities; -5.5% in formal sciences) — even when controlling for an increase in the volume of papers. Our findings raise questions about the scientific community’s engagement with past literature, particularly for NLP, and the potential consequences of neglecting older but relevant research. The data and a demo showcasing our results are publicly available.

pdf bib
SemEval-2025 Task 11: Bridging the Gap in Text-Based Emotion Detection
Shamsuddeen Hassan Muhammad | Nedjma Ousidhoum | Idris Abdulmumin | Seid Muhie Yimam | Jan Philip Wahle | Terry Lima Ruas | Meriem Beloucif | Christine De Kock | Tadesse Destaw Belay | Ibrahim Said Ahmad | Nirmal Surange | Daniela Teodorescu | David Ifeoluwa Adelani | Alham Fikri Aji | Felermino Dario Mario Ali | Vladimir Araujo | Abinew Ali Ayele | Oana Ignat | Alexander Panchenko | Yi Zhou | Saif Mohammad
Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025)

We present our shared task on text-based emotion detection, covering more than 30 languages from seven distinct language families. These languages are predominantly low-resource and spoken across various continents. The data instances are multi-labeled into six emotional classes, with additional datasets in 11 languages annotated for emotion intensity. Participants were asked to predict labels in three tracks: (a) emotion labels in monolingual settings, (b) emotion intensity scores, and (c) emotion labels in cross-lingual settings.