Archaeology at BEA 2025 Shared Task: Are Simple Baselines Good Enough?

Ana Roșu, Jany-Gabriel Ispas, Sergiu Nisioi


Abstract
This paper describes our approach for 5 classification tasks from Building Educational Applications (BEA) 2025 Shared Task.Our methods range from classical machine learning models to large-scale transformers with fine-tuning and prompting strategies. Despite the diversity of approaches, performance differences were often minor, suggesting a strong surface-level signal and the limiting effect of annotation noise—particularly around the “To some extent” label. Under lenient evaluation, simple models perform competitively, showing their effectiveness in low-resource settings. Our submissions ranked in the top 10 in four of five tracks.
Anthology ID:
2025.bea-1.98
Volume:
Proceedings of the 20th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2025)
Month:
July
Year:
2025
Address:
Vienna, Austria
Editors:
Ekaterina Kochmar, Bashar Alhafni, Marie Bexte, Jill Burstein, Andrea Horbach, Ronja Laarmann-Quante, Anaïs Tack, Victoria Yaneva, Zheng Yuan
Venues:
BEA | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1224–1241
Language:
URL:
https://preview.aclanthology.org/landing_page/2025.bea-1.98/
DOI:
Bibkey:
Cite (ACL):
Ana Roșu, Jany-Gabriel Ispas, and Sergiu Nisioi. 2025. Archaeology at BEA 2025 Shared Task: Are Simple Baselines Good Enough?. In Proceedings of the 20th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2025), pages 1224–1241, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
Archaeology at BEA 2025 Shared Task: Are Simple Baselines Good Enough? (Roșu et al., BEA 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/landing_page/2025.bea-1.98.pdf