Abstract
We present a method for populating fine-grained classes (e.g., “1950s American jazz musicians”) with instances (e.g., Charles Mingus ). While state-of-the-art methods tend to treat class labels as single lexical units, the proposed method considers each of the individual modifiers in the class label relative to the head. An evaluation on the task of reconstructing Wikipedia category pages demonstrates a >10 point increase in AUC, over a strong baseline relying on widely-used Hearst patterns.- Anthology ID:
- P17-1192
- Volume:
- Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
- Month:
- July
- Year:
- 2017
- Address:
- Vancouver, Canada
- Venue:
- ACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 2099–2109
- Language:
- URL:
- https://aclanthology.org/P17-1192
- DOI:
- 10.18653/v1/P17-1192
- Cite (ACL):
- Ellie Pavlick and Marius Paşca. 2017. Identifying 1950s American Jazz Musicians: Fine-Grained IsA Extraction via Modifier Composition. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 2099–2109, Vancouver, Canada. Association for Computational Linguistics.
- Cite (Informal):
- Identifying 1950s American Jazz Musicians: Fine-Grained IsA Extraction via Modifier Composition (Pavlick & Paşca, ACL 2017)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/P17-1192.pdf