NTA at SemEval-2025 Task 11: Enhanced Multilingual Textual Multi-label Emotion Detection via Integrated Augmentation Learning

Nguyen Pham Hoang Le, An Nguyen Tran Khuong, Tram Nguyen Thi Ngoc, Thin Dang Van


Abstract
Emotion detection in text is crucial for various applications, but progress, especially in multi-label scenarios, is often hampered by data scarcity, particularly for low-resource languages like Emakhuwa and Tigrinya. This lack of data limits model performance and generalizability. To address this, the NTA team developed a system for SemEval-2025 Task 11, leveraging data augmentation techniques: swap, deletion, oversampling, emotion-focused synonym insertion and synonym replacement to enhance baseline models for multilingual textual multi-label emotion detection. Our proposed system achieved significantly higher macro F1-scores compared to the baseline across multiple languages.
Anthology ID:
2025.semeval-1.237
Volume:
Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025)
Month:
July
Year:
2025
Address:
Vienna, Austria
Editors:
Sara Rosenthal, Aiala Rosá, Debanjan Ghosh, Marcos Zampieri
Venues:
SemEval | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1795–1801
Language:
URL:
https://preview.aclanthology.org/transition-to-people-yaml/2025.semeval-1.237/
DOI:
Bibkey:
Cite (ACL):
Nguyen Pham Hoang Le, An Nguyen Tran Khuong, Tram Nguyen Thi Ngoc, and Thin Dang Van. 2025. NTA at SemEval-2025 Task 11: Enhanced Multilingual Textual Multi-label Emotion Detection via Integrated Augmentation Learning. In Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025), pages 1795–1801, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
NTA at SemEval-2025 Task 11: Enhanced Multilingual Textual Multi-label Emotion Detection via Integrated Augmentation Learning (Pham Hoang Le et al., SemEval 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/transition-to-people-yaml/2025.semeval-1.237.pdf