SINAI at SemEval-2020 Task 12: Offensive Language Identification Exploring Transfer Learning Models
Flor Miriam Plaza del Arco, M. Dolores Molina González, Alfonso Ureña-López, Maite Martin
Abstract
This paper describes the participation of SINAI team at Task 12: OffensEval 2: Multilingual Offensive Language Identification in Social Media. In particular, the participation in Sub-task A in English which consists of identifying tweets as offensive or not offensive. We preprocess the dataset according to the language characteristics used on social media. Then, we select a small set from the training set provided by the organizers and fine-tune different Transformerbased models in order to test their effectiveness. Our team ranks 20th out of 85 participants in Subtask-A using the XLNet model.- Anthology ID:
- 2020.semeval-1.211
- Volume:
- Proceedings of the Fourteenth Workshop on Semantic Evaluation
- Month:
- December
- Year:
- 2020
- Address:
- Barcelona (online)
- Editors:
- Aurelie Herbelot, Xiaodan Zhu, Alexis Palmer, Nathan Schneider, Jonathan May, Ekaterina Shutova
- Venue:
- SemEval
- SIG:
- SIGLEX
- Publisher:
- International Committee for Computational Linguistics
- Note:
- Pages:
- 1622–1627
- Language:
- URL:
- https://aclanthology.org/2020.semeval-1.211
- DOI:
- 10.18653/v1/2020.semeval-1.211
- Cite (ACL):
- Flor Miriam Plaza del Arco, M. Dolores Molina González, Alfonso Ureña-López, and Maite Martin. 2020. SINAI at SemEval-2020 Task 12: Offensive Language Identification Exploring Transfer Learning Models. In Proceedings of the Fourteenth Workshop on Semantic Evaluation, pages 1622–1627, Barcelona (online). International Committee for Computational Linguistics.
- Cite (Informal):
- SINAI at SemEval-2020 Task 12: Offensive Language Identification Exploring Transfer Learning Models (Plaza del Arco et al., SemEval 2020)
- PDF:
- https://preview.aclanthology.org/emnlp22-frontmatter/2020.semeval-1.211.pdf