Fossils at SemEval-2025 Task 9: Tasting Loss Functions for Food Hazard Detection in Text Reports

Aman Sinha, Federica Gamba


Abstract
Food hazard detection is an emerging field where NLP solutions are being explored. Despite the recent accessibility of powerful language models, one of the key challenges that still persists is the high class imbalance within datasets, often referred to in the literature as the {textit{long tail problem}}.In this work, we present a study exploring different loss functions borrowed from the field of visual recognition, to tackle long-tailed class imbalance for food hazard detection in text reports. Our submission to SemEval-2025 Task 9 on the Food Hazard Detection Challenge shows how re-weighting mechanism in loss functions prove beneficial in class imbalance scenarios. In particular, we empirically show that class-balanced and focal loss functions outperform all other loss strategies for Subtask 1 and 2 respectively.
Anthology ID:
2025.semeval-1.199
Volume:
Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025)
Month:
July
Year:
2025
Address:
Vienna, Austria
Editors:
Sara Rosenthal, Aiala Rosá, Debanjan Ghosh, Marcos Zampieri
Venues:
SemEval | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1515–1521
Language:
URL:
https://preview.aclanthology.org/transition-to-people-yaml/2025.semeval-1.199/
DOI:
Bibkey:
Cite (ACL):
Aman Sinha and Federica Gamba. 2025. Fossils at SemEval-2025 Task 9: Tasting Loss Functions for Food Hazard Detection in Text Reports. In Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025), pages 1515–1521, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
Fossils at SemEval-2025 Task 9: Tasting Loss Functions for Food Hazard Detection in Text Reports (Sinha & Gamba, SemEval 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/transition-to-people-yaml/2025.semeval-1.199.pdf