UTMN at SemEval-2020 Task 11: A Kitchen Solution to Automatic Propaganda Detection
Elena Mikhalkova, Nadezhda Ganzherli, Anna Glazkova, Yuliya Bidulya
Abstract
The article describes a fast solution to propaganda detection at SemEval-2020 Task 11, based on feature adjustment. We use per-token vectorization of features and a simple Logistic Regression classifier to quickly test different hypotheses about our data. We come up with what seems to us the best solution, however, we are unable to align it with the result of the metric suggested by the organizers of the task. We test how our system handles class and feature imbalance by varying the number of samples of two classes (Propaganda and None) in the training set, the size of a context window in which a token is vectorized and combination of vectorization means. The result of our system at SemEval2020 Task 11 is F-score=0.37.- Anthology ID:
- 2020.semeval-1.245
- Volume:
- Proceedings of the Fourteenth Workshop on Semantic Evaluation
- Month:
- December
- Year:
- 2020
- Address:
- Barcelona (online)
- Venue:
- SemEval
- SIGs:
- SIGLEX | SIGSEM
- Publisher:
- International Committee for Computational Linguistics
- Note:
- Pages:
- 1858–1864
- Language:
- URL:
- https://aclanthology.org/2020.semeval-1.245
- DOI:
- 10.18653/v1/2020.semeval-1.245
- Cite (ACL):
- Elena Mikhalkova, Nadezhda Ganzherli, Anna Glazkova, and Yuliya Bidulya. 2020. UTMN at SemEval-2020 Task 11: A Kitchen Solution to Automatic Propaganda Detection. In Proceedings of the Fourteenth Workshop on Semantic Evaluation, pages 1858–1864, Barcelona (online). International Committee for Computational Linguistics.
- Cite (Informal):
- UTMN at SemEval-2020 Task 11: A Kitchen Solution to Automatic Propaganda Detection (Mikhalkova et al., SemEval 2020)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/2020.semeval-1.245.pdf
- Code
- evrog/Propaganda-Detection-at-SemEval-2020