INNOVATORS at SemEval-2021 Task-11: A Dependency Parsing and BERT-based model for Extracting Contribution Knowledge from Scientific Papers
Hardik Arora, Tirthankar Ghosal, Sandeep Kumar, Suraj Patwal, Phil Gooch
Abstract
In this work, we describe our system submission to the SemEval 2021 Task 11: NLP Contribution Graph Challenge. We attempt all the three sub-tasks in the challenge and report our results. Subtask 1 aims to identify the contributing sentences in a given publication. Subtask 2 follows from Subtask 1 to extract the scientific term and predicate phrases from the identified contributing sentences. The final Subtask 3 entails extracting triples (subject, predicate, object) from the phrases and categorizing them under one or more defined information units. With the NLPContributionGraph Shared Task, the organizers formalized the building of a scholarly contributions-focused graph over NLP scholarly articles as an automated task. Our approaches include a BERT-based classification model for identifying the contributing sentences in a research publication, a rule-based dependency parsing for phrase extraction, followed by a CNN-based model for information units classification, and a set of rules for triples extraction. The quantitative results show that we obtain the 5th, 5th, and 7th rank respectively in three evaluation phases. We make our codes available at https://github.com/HardikArora17/SemEval-2021-INNOVATORS.- Anthology ID:
- 2021.semeval-1.61
- Volume:
- Proceedings of the 15th International Workshop on Semantic Evaluation (SemEval-2021)
- Month:
- August
- Year:
- 2021
- Address:
- Online
- Editors:
- Alexis Palmer, Nathan Schneider, Natalie Schluter, Guy Emerson, Aurelie Herbelot, Xiaodan Zhu
- Venue:
- SemEval
- SIG:
- SIGLEX
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 502–510
- Language:
- URL:
- https://aclanthology.org/2021.semeval-1.61
- DOI:
- 10.18653/v1/2021.semeval-1.61
- Cite (ACL):
- Hardik Arora, Tirthankar Ghosal, Sandeep Kumar, Suraj Patwal, and Phil Gooch. 2021. INNOVATORS at SemEval-2021 Task-11: A Dependency Parsing and BERT-based model for Extracting Contribution Knowledge from Scientific Papers. In Proceedings of the 15th International Workshop on Semantic Evaluation (SemEval-2021), pages 502–510, Online. Association for Computational Linguistics.
- Cite (Informal):
- INNOVATORS at SemEval-2021 Task-11: A Dependency Parsing and BERT-based model for Extracting Contribution Knowledge from Scientific Papers (Arora et al., SemEval 2021)
- PDF:
- https://preview.aclanthology.org/ingest-acl-2023-videos/2021.semeval-1.61.pdf
- Data
- SemEval-2021 Task-11