MyMy at SemEval-2025 Task 9: A Robust Knowledge-Augmented Data Approach for Reliable Food Hazard Detection

Ben Phan, Jung - Hsien Chiang


Abstract
Food hazard detection from web sources, including social media and official food agency websites, is crucial for mitigating economic and public health risks. However, challenges such as class imbalance and the need for transparent, explainable AI remain. To address these issues, we propose a Knowledge-Augmented Data approach using Retrieval-Augmented Generation (RAG) to improve food incident report classification in SemEval-2025 Task 9. Our method leverages domain-specific knowledge to enrich datasets and curate high-quality data, enhancing overall integrity. We hypothesize that knowledge-augmented data improves Macro-F1 scores, the primary evaluation metric. Our approach achieved a top-3 average ranking across both subtasks, demonstrating its effectiveness in advancing NLP applications for food safety and contributing to more reliable food hazard detection systems.
Anthology ID:
2025.semeval-1.112
Volume:
Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025)
Month:
July
Year:
2025
Address:
Vienna, Austria
Editors:
Sara Rosenthal, Aiala Rosá, Debanjan Ghosh, Marcos Zampieri
Venues:
SemEval | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
812–822
Language:
URL:
https://preview.aclanthology.org/corrections-2025-08/2025.semeval-1.112/
DOI:
Bibkey:
Cite (ACL):
Ben Phan and Jung - Hsien Chiang. 2025. MyMy at SemEval-2025 Task 9: A Robust Knowledge-Augmented Data Approach for Reliable Food Hazard Detection. In Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025), pages 812–822, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
MyMy at SemEval-2025 Task 9: A Robust Knowledge-Augmented Data Approach for Reliable Food Hazard Detection (Phan & Chiang, SemEval 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/corrections-2025-08/2025.semeval-1.112.pdf