LIPN at SemEval-2017 Task 10: Filtering Candidate Keyphrases from Scientific Publications with Part-of-Speech Tag Sequences to Train a Sequence Labeling Model

Simon David Hernandez; Davide Buscaldi; Thierry Charnois

doi:10.18653/v1/S17-2174

LIPN at SemEval-2017 Task 10: Filtering Candidate Keyphrases from Scientific Publications with Part-of-Speech Tag Sequences to Train a Sequence Labeling Model

Simon David Hernandez, Davide Buscaldi, Thierry Charnois

Abstract

This paper describes the system used by the team LIPN in SemEval 2017 Task 10: Extracting Keyphrases and Relations from Scientific Publications. The team participated in Scenario 1, that includes three subtasks, Identification of keyphrases (Subtask A), Classification of identified keyphrases (Subtask B) and Extraction of relationships between two identified keyphrases (Subtask C). The presented system was mainly focused on the use of part-of-speech tag sequences to filter candidate keyphrases for Subtask A. Subtasks A and B were addressed as a sequence labeling problem using Conditional Random Fields (CRFs) and even though Subtask C was out of the scope of this approach, one rule was included to identify synonyms.

Anthology ID:: S17-2174
Volume:: Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017)
Month:: August
Year:: 2017
Address:: Vancouver, Canada
Editors:: Steven Bethard, Marine Carpuat, Marianna Apidianaki, Saif M. Mohammad, Daniel Cer, David Jurgens
Venue:: SemEval
SIG:: SIGLEX
Publisher:: Association for Computational Linguistics
Note:
Pages:: 995–999
Language:
URL:: https://preview.aclanthology.org/nschneid-patch-2/S17-2174/
DOI:: 10.18653/v1/S17-2174
Bibkey:
Cite (ACL):: Simon David Hernandez, Davide Buscaldi, and Thierry Charnois. 2017. LIPN at SemEval-2017 Task 10: Filtering Candidate Keyphrases from Scientific Publications with Part-of-Speech Tag Sequences to Train a Sequence Labeling Model. In Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017), pages 995–999, Vancouver, Canada. Association for Computational Linguistics.
Cite (Informal):: LIPN at SemEval-2017 Task 10: Filtering Candidate Keyphrases from Scientific Publications with Part-of-Speech Tag Sequences to Train a Sequence Labeling Model (Hernandez et al., SemEval 2017)
Copy Citation:
PDF:: https://preview.aclanthology.org/nschneid-patch-2/S17-2174.pdf

PDF Cite Search Fix data