Abstract
This paper presents our relation extraction system for subtask C of SemEval-2017 Task 10: ScienceIE. Assuming that the keyphrases are already annotated in the input data, our work explores a wide range of linguistic features, applies various feature selection techniques, optimizes the hyper parameters and class weights and experiments with different problem formulations (single classification model vs individual classifiers for each keyphrase type, single-step classifier vs pipeline classifier for hyponym relations). Performance of five popular classification algorithms are evaluated for each problem formulation along with feature selection. The best setting achieved an F1 score of 71.0% for synonym and 30.0% for hyponym relation on the test data.- Anthology ID:
- S17-2168
- Volume:
- Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017)
- Month:
- August
- Year:
- 2017
- Address:
- Vancouver, Canada
- Editors:
- Steven Bethard, Marine Carpuat, Marianna Apidianaki, Saif M. Mohammad, Daniel Cer, David Jurgens
- Venue:
- SemEval
- SIG:
- SIGLEX
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 965–968
- Language:
- URL:
- https://aclanthology.org/S17-2168
- DOI:
- 10.18653/v1/S17-2168
- Cite (ACL):
- Biswanath Barik and Erwin Marsi. 2017. NTNU-2 at SemEval-2017 Task 10: Identifying Synonym and Hyponym Relations among Keyphrases in Scientific Documents. In Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017), pages 965–968, Vancouver, Canada. Association for Computational Linguistics.
- Cite (Informal):
- NTNU-2 at SemEval-2017 Task 10: Identifying Synonym and Hyponym Relations among Keyphrases in Scientific Documents (Barik & Marsi, SemEval 2017)
- PDF:
- https://preview.aclanthology.org/naacl24-info/S17-2168.pdf
- Data
- SemEval-2010 Task-8, SemEval-2017 Task-10