CoT-based Data Augmentation Strategy for Persuasion Techniques Detection
Dailin Li, Chuhan Wang, Xin Zou, Junlong Wang, Peng Chen, Jian Wang, Liang Yang, Hongfei Lin
Abstract
Detecting persuasive communication is an important topic in Natural Language Processing (NLP), as it can be useful in identifying fake information on social media. We have developed a system to identify applied persuasion techniques in text fragments across four languages: English, Bulgarian, North Macedonian, and Arabic. Our system uses data augmentation methods and employs an ensemble strategy that combines the strengths of both RoBERTa and DeBERTa models. Due to limited resources, we concentrated solely on task 1, and our solution achieved the top ranking in the English track during the official assessments. We also analyse the impact of architectural decisions, data constructionand training strategies.- Anthology ID:
- 2024.semeval-1.190
- Volume:
- Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024)
- Month:
- June
- Year:
- 2024
- Address:
- Mexico City, Mexico
- Editors:
- Atul Kr. Ojha, A. Seza Doğruöz, Harish Tayyar Madabushi, Giovanni Da San Martino, Sara Rosenthal, Aiala Rosá
- Venue:
- SemEval
- SIG:
- SIGLEX
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 1315–1321
- Language:
- URL:
- https://aclanthology.org/2024.semeval-1.190
- DOI:
- 10.18653/v1/2024.semeval-1.190
- Cite (ACL):
- Dailin Li, Chuhan Wang, Xin Zou, Junlong Wang, Peng Chen, Jian Wang, Liang Yang, and Hongfei Lin. 2024. CoT-based Data Augmentation Strategy for Persuasion Techniques Detection. In Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval-2024), pages 1315–1321, Mexico City, Mexico. Association for Computational Linguistics.
- Cite (Informal):
- CoT-based Data Augmentation Strategy for Persuasion Techniques Detection (Li et al., SemEval 2024)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-4/2024.semeval-1.190.pdf