CLaC-BP at SemEval-2021 Task 8: SciBERT Plus Rules for MeasEval

Benjamin Therien, Parsa Bagherzadeh, Sabine Bergler


Abstract
This paper explains the design of a heterogeneous system that ranked eighth in competition in SemEval2021 Task 8. We analyze ablation experiments and demonstrate how the system components, namely tokenizer, unit identifier, modifier classifier, and language model, affect the overall score. We compare our results to similar experiments from the literature and introduce a grouping algorithm developed in the post-evaluation phase that increased our system’s overall score, hypothetically elevating our competition rank from eight to six.
Anthology ID:
2021.semeval-1.49
Volume:
Proceedings of the 15th International Workshop on Semantic Evaluation (SemEval-2021)
Month:
August
Year:
2021
Address:
Online
Editors:
Alexis Palmer, Nathan Schneider, Natalie Schluter, Guy Emerson, Aurelie Herbelot, Xiaodan Zhu
Venue:
SemEval
SIG:
SIGLEX
Publisher:
Association for Computational Linguistics
Note:
Pages:
410–415
Language:
URL:
https://aclanthology.org/2021.semeval-1.49
DOI:
10.18653/v1/2021.semeval-1.49
Bibkey:
Cite (ACL):
Benjamin Therien, Parsa Bagherzadeh, and Sabine Bergler. 2021. CLaC-BP at SemEval-2021 Task 8: SciBERT Plus Rules for MeasEval. In Proceedings of the 15th International Workshop on Semantic Evaluation (SemEval-2021), pages 410–415, Online. Association for Computational Linguistics.
Cite (Informal):
CLaC-BP at SemEval-2021 Task 8: SciBERT Plus Rules for MeasEval (Therien et al., SemEval 2021)
Copy Citation:
PDF:
https://preview.aclanthology.org/emnlp-22-attachments/2021.semeval-1.49.pdf