Boosting Open Information Extraction with Noun-Based Relations

Clarissa Xavier, Vera Lima


Abstract
Open Information Extraction (Open IE) is a strategy for learning relations from texts, regardless the domain and without predefining these relations. Work in this area has focused mainly on verbal relations. In order to extend Open IE to extract relationships that are not expressed by verbs, we present a novel Open IE approach that extracts relations expressed in noun compounds (NCs), such as (oil, extracted from, olive) from “olive oil”, or in adjective-noun pairs (ANs), such as (moon, that is, gorgeous) from “gorgeous moon”. The approach consists of three steps: detection of NCs and ANs, interpretation of these compounds in view of corpus enrichment and extraction of relations from the enriched corpus. To confirm the feasibility of this method we created a prototype and evaluated the impact of the application of our proposal in two state-of-the-art Open IE extractors. Based on these tests we conclude that the proposed approach is an important step to fulfil the gap concerning the extraction of relations within the noun compounds and adjective-noun pairs in Open IE.
Anthology ID:
L14-1157
Volume:
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Month:
May
Year:
2014
Address:
Reykjavik, Iceland
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
96–100
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/125_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Clarissa Xavier and Vera Lima. 2014. Boosting Open Information Extraction with Noun-Based Relations. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 96–100, Reykjavik, Iceland. European Language Resources Association (ELRA).
Cite (Informal):
Boosting Open Information Extraction with Noun-Based Relations (Xavier & Lima, LREC 2014)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/125_Paper.pdf