Abstract
This paper introduces a data augmentation technique for the task of detecting human values. Our approach involves generating additional examples using metadata that describes the labels in the datasets. We evaluated the effectiveness of our method by fine-tuning BERT and RoBERTa models on our augmented dataset and comparing their F1 -scores to those of the non-augmented dataset. We obtained competitive results on both the Main test set and the Nahj al-Balagha test set, ranking 14th and 7th respectively among the participants. We also demonstrate that by incorporating our augmentation technique, the classification performance of BERT and RoBERTa is improved, resulting in an increase of up to 10.1% in their F1-score.- Anthology ID:
- 2023.semeval-1.279
- Volume:
- Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023)
- Month:
- July
- Year:
- 2023
- Address:
- Toronto, Canada
- Editors:
- Atul Kr. Ojha, A. Seza Doğruöz, Giovanni Da San Martino, Harish Tayyar Madabushi, Ritesh Kumar, Elisa Sartori
- Venue:
- SemEval
- SIG:
- SIGLEX
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 2033–2038
- Language:
- URL:
- https://aclanthology.org/2023.semeval-1.279
- DOI:
- 10.18653/v1/2023.semeval-1.279
- Cite (ACL):
- Erfan Moosavi Monazzah and Sauleh Eetemadi. 2023. Prodicus at SemEval-2023 Task 4: Enhancing Human Value Detection with Data Augmentation and Fine-Tuned Language Models. In Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023), pages 2033–2038, Toronto, Canada. Association for Computational Linguistics.
- Cite (Informal):
- Prodicus at SemEval-2023 Task 4: Enhancing Human Value Detection with Data Augmentation and Fine-Tuned Language Models (Moosavi Monazzah & Eetemadi, SemEval 2023)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-5/2023.semeval-1.279.pdf