Semantic Search in Documents Enriched by LOD-based Annotations

Pavel Smrz, Jan Kouril


Abstract
This paper deals with information retrieval on semantically enriched web-scale document collections. It particularly focuses on web-crawled content in which mentions of entities appearing in Freebase, DBpedia and other Linked Open Data resources have been identified. A special attention is paid to indexing structures and advanced query mechanisms that have been employed into a new semantic retrieval system. Scalability features are discussed together with performance statistics and results of experimental evaluation of presented approaches. Examples given to demonstrate key features of the developed solution correspond to the cultural heritage domain in which the results of our work have been primarily applied.
Anthology ID:
L14-1040
Volume:
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
Month:
May
Year:
2014
Address:
Reykjavik, Iceland
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
3724–3727
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/1058_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Pavel Smrz and Jan Kouril. 2014. Semantic Search in Documents Enriched by LOD-based Annotations. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pages 3724–3727, Reykjavik, Iceland. European Language Resources Association (ELRA).
Cite (Informal):
Semantic Search in Documents Enriched by LOD-based Annotations (Smrz & Kouril, LREC 2014)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2014/pdf/1058_Paper.pdf