Abstract
Counterfactuals describe events counter to facts and hence naturally involve common sense, knowledge, and reasoning. SemEval 2020 task 5 is focusing on this field. We participate in the subtask 1 and we use BERT as our system. Our Innovations are feature extraction and data augmentation. We extract and summarize features of counterfactual statements, augment counterfactual examples in training set with the help of these features, and two general methods of data augmentation is experimented in our work. We demonstrate the effectiveness of our approaches, which achieves 0.95 of subtask 1 in F1 while using only a subset of giving training set to fine-tune the BERT model, and our official submission achieves F1 0.802, which ranks us 16th in the competition.- Anthology ID:
- 2020.semeval-1.81
- Volume:
- Proceedings of the Fourteenth Workshop on Semantic Evaluation
- Month:
- December
- Year:
- 2020
- Address:
- Barcelona (online)
- Editors:
- Aurelie Herbelot, Xiaodan Zhu, Alexis Palmer, Nathan Schneider, Jonathan May, Ekaterina Shutova
- Venue:
- SemEval
- SIG:
- SIGLEX
- Publisher:
- International Committee for Computational Linguistics
- Note:
- Pages:
- 633–639
- Language:
- URL:
- https://aclanthology.org/2020.semeval-1.81
- DOI:
- 10.18653/v1/2020.semeval-1.81
- Cite (ACL):
- Chang Liu and Dong Yu. 2020. BLCU-NLP at SemEval-2020 Task 5: Data Augmentation for Efficient Counterfactual Detecting. In Proceedings of the Fourteenth Workshop on Semantic Evaluation, pages 633–639, Barcelona (online). International Committee for Computational Linguistics.
- Cite (Informal):
- BLCU-NLP at SemEval-2020 Task 5: Data Augmentation for Efficient Counterfactual Detecting (Liu & Yu, SemEval 2020)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-3/2020.semeval-1.81.pdf