Challenges in the Knowledge Base Population Slot Filling Task

Bonan Min, Ralph Grishman


Abstract
The Knowledge Based Population (KBP) evaluation track of the Text Analysis Conferences (TAC) has been held for the past 3 years. One of the two tasks of KBP is slot filling: finding within a large corpus the values of a set of attributes of given people and organizations. This task has proven very challenging, with top systems rarely exceeding 30% F-measure. In this paper, we present an error analysis and classification for those answers which could be found by a manual corpus search but were not found by any of the systems participating in the 2010 evaluation. The most common sources of failure were limitations on inference, errors in coreference (particularly with nominal anaphors), and errors in named entity recognition. We relate the types of errors to the characteristics of the task and show the wide diversity of problems that must be addressed to improve overall performance.
Anthology ID:
L12-1104
Volume:
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Month:
May
Year:
2012
Address:
Istanbul, Turkey
Venue:
LREC
SIG:
Publisher:
European Language Resources Association (ELRA)
Note:
Pages:
1137–1142
Language:
URL:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/256_Paper.pdf
DOI:
Bibkey:
Cite (ACL):
Bonan Min and Ralph Grishman. 2012. Challenges in the Knowledge Base Population Slot Filling Task. In Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), pages 1137–1142, Istanbul, Turkey. European Language Resources Association (ELRA).
Cite (Informal):
Challenges in the Knowledge Base Population Slot Filling Task (Min & Grishman, LREC 2012)
Copy Citation:
PDF:
http://www.lrec-conf.org/proceedings/lrec2012/pdf/256_Paper.pdf