Demand-Weighted Completeness Prediction for a Knowledge Base

Andrew Hopkinson, Amit Gurdasani, Dave Palfrey, Arpit Mittal


Abstract
In this paper we introduce the notion of Demand-Weighted Completeness, allowing estimation of the completeness of a knowledge base with respect to how it is used. Defining an entity by its classes, we employ usage data to predict the distribution over relations for that entity. For example, instances of person in a knowledge base may require a birth date, name and nationality to be considered complete. These predicted relation distributions enable detection of important gaps in the knowledge base, and define the required facts for unseen entities. Such characterisation of the knowledge base can also quantify how usage and completeness change over time. We demonstrate a method to measure Demand-Weighted Completeness, and show that a simple neural network model performs well at this prediction task.
Anthology ID:
N18-3025
Volume:
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 3 (Industry Papers)
Month:
June
Year:
2018
Address:
New Orleans - Louisiana
Venue:
NAACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
200–207
Language:
URL:
https://aclanthology.org/N18-3025
DOI:
10.18653/v1/N18-3025
Bibkey:
Cite (ACL):
Andrew Hopkinson, Amit Gurdasani, Dave Palfrey, and Arpit Mittal. 2018. Demand-Weighted Completeness Prediction for a Knowledge Base. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 3 (Industry Papers), pages 200–207, New Orleans - Louisiana. Association for Computational Linguistics.
Cite (Informal):
Demand-Weighted Completeness Prediction for a Knowledge Base (Hopkinson et al., NAACL 2018)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/N18-3025.pdf
Video:
 http://vimeo.com/277674060