Prodicus at SemEval-2023 Task 4: Enhancing Human Value Detection with Data Augmentation and Fine-Tuned Language Models

Erfan Moosavi Monazzah, Sauleh Eetemadi


Abstract
This paper introduces a data augmentation technique for the task of detecting human values. Our approach involves generating additional examples using metadata that describes the labels in the datasets. We evaluated the effectiveness of our method by fine-tuning BERT and RoBERTa models on our augmented dataset and comparing their F1 -scores to those of the non-augmented dataset. We obtained competitive results on both the Main test set and the Nahj al-Balagha test set, ranking 14th and 7th respectively among the participants. We also demonstrate that by incorporating our augmentation technique, the classification performance of BERT and RoBERTa is improved, resulting in an increase of up to 10.1% in their F1-score.
Anthology ID:
2023.semeval-1.279
Volume:
Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023)
Month:
July
Year:
2023
Address:
Toronto, Canada
Editors:
Atul Kr. Ojha, A. Seza Doğruöz, Giovanni Da San Martino, Harish Tayyar Madabushi, Ritesh Kumar, Elisa Sartori
Venue:
SemEval
SIG:
SIGLEX
Publisher:
Association for Computational Linguistics
Note:
Pages:
2033–2038
Language:
URL:
https://aclanthology.org/2023.semeval-1.279
DOI:
10.18653/v1/2023.semeval-1.279
Bibkey:
Cite (ACL):
Erfan Moosavi Monazzah and Sauleh Eetemadi. 2023. Prodicus at SemEval-2023 Task 4: Enhancing Human Value Detection with Data Augmentation and Fine-Tuned Language Models. In Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023), pages 2033–2038, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):
Prodicus at SemEval-2023 Task 4: Enhancing Human Value Detection with Data Augmentation and Fine-Tuned Language Models (Moosavi Monazzah & Eetemadi, SemEval 2023)
Copy Citation:
PDF:
https://preview.aclanthology.org/nschneid-patch-5/2023.semeval-1.279.pdf