ECNUICA at SemEval-2021 Task 11: Rule based Information Extraction Pipeline

Jiaju Lin, Jing Ling, Zhiwei Wang, Jiawei Liu, Qin Chen, Liang He


Abstract
This paper presents our endeavor for solving task11, NLPContributionGraph, of SemEval-2021. The purpose of the task was to extract triples from a paper in the Nature Language Processing field for constructing an Open Research Knowledge Graph. The task includes three sub-tasks: detecting the contribution sentences in papers, identifying scientific terms and predicate phrases from the contribution sentences; and inferring triples in the form of (subject, predicate, object) as statements for Knowledge Graph building. In this paper, we apply an ensemble of various fine-tuned pre-trained language models (PLM) for tasks one and two. In addition, self-training methods are adopted for tackling the shortage of annotated data. For the third task, rather than using classic neural open information extraction (OIE) architectures, we generate potential triples via manually designed rules and develop a binary classifier to differentiate positive ones from others. The quantitative results show that we obtain the 4th, 2nd, and 2nd rank in three evaluation phases.
Anthology ID:
2021.semeval-1.185
Volume:
Proceedings of the 15th International Workshop on Semantic Evaluation (SemEval-2021)
Month:
August
Year:
2021
Address:
Online
Editors:
Alexis Palmer, Nathan Schneider, Natalie Schluter, Guy Emerson, Aurelie Herbelot, Xiaodan Zhu
Venue:
SemEval
SIG:
SIGLEX
Publisher:
Association for Computational Linguistics
Note:
Pages:
1295–1302
Language:
URL:
https://aclanthology.org/2021.semeval-1.185
DOI:
10.18653/v1/2021.semeval-1.185
Bibkey:
Cite (ACL):
Jiaju Lin, Jing Ling, Zhiwei Wang, Jiawei Liu, Qin Chen, and Liang He. 2021. ECNUICA at SemEval-2021 Task 11: Rule based Information Extraction Pipeline. In Proceedings of the 15th International Workshop on Semantic Evaluation (SemEval-2021), pages 1295–1302, Online. Association for Computational Linguistics.
Cite (Informal):
ECNUICA at SemEval-2021 Task 11: Rule based Information Extraction Pipeline (Lin et al., SemEval 2021)
Copy Citation:
PDF:
https://preview.aclanthology.org/emnlp-22-attachments/2021.semeval-1.185.pdf
Data
SemEval-2021 Task-11