Zareen Syed
2010
Unsupervised techniques for discovering ontology elements from Wikipedia article links
Zareen Syed
|
Tim Finin
Proceedings of the NAACL HLT 2010 First International Workshop on Formalisms and Methodology for Learning by Reading
A Hybrid Approach to Unsupervised Relation Discovery Based on Linguistic Analysis and Semantic Typing
Zareen Syed
|
Evelyne Viegas
Proceedings of the NAACL HLT 2010 First International Workshop on Formalisms and Methodology for Learning by Reading
Automatic Discovery of Semantic Relations using MindNet
Zareen Syed
|
Evelyne Viegas
|
Savas Parastatidis
Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)
Information extraction deals with extracting entities (such as people, organizations or locations) and named relations between entities (such as ""People born-in Country"") from text documents. An important challenge in information extraction is the labeling of training data which is usually done manually and is therefore very laborious and in certain cases impractical. This paper introduces a new model to extract semantic relations fully automatically from text using the Encarta encyclopedia and lexical-semantic relations discovered by MindNet. MindNet is a lexical knowledge base that can be constructed fully automatically from a given text corpus without any human intervention. Encarta articles are categorized and linked to related articles by experts. We demonstrate how the structured data available in Encarta and the lexical semantic relations between words in MindNet can be used to enrich MindNet with semantic relations between entities. With a slight trade off of accuracy a semantically enriched MindNet can be used to extract relations from a text corpus without any human intervention.
Search