Metadata Shaping: A Simple Approach for Knowledge-Enhanced Language Models

Simran Arora, Sen Wu, Enci Liu, Christopher Re


Abstract
Popular language models (LMs) struggle to capture knowledge about rare tail facts and entities. Since widely used systems such as search and personal-assistants must support the long tail of entities that users ask about, there has been significant effort towards enhancing these base LMs with factual knowledge. We observe proposed methods typically start with a base LM and data that has been annotated with entity metadata, then change the model, by modifying the architecture or introducing auxiliary loss terms to better capture entity knowledge. In this work, we question this typical process and ask to what extent can we match the quality of model modifications, with a simple alternative: using a base LM and only changing the data. We propose metadata shaping, a method which inserts substrings corresponding to the readily available entity metadata, e.g. types and descriptions, into examples at train and inference time based on mutual information. Despite its simplicity, metadata shaping is quite effective. On standard evaluation benchmarks for knowledge-enhanced LMs, the method exceeds the base-LM baseline by an average of 4.3 F1 points and achieves state-of-the-art results. We further show the gains are on average 4.4x larger for the slice of examples containing tail vs. popular entities.
Anthology ID:
2022.findings-acl.137
Volume:
Findings of the Association for Computational Linguistics: ACL 2022
Month:
May
Year:
2022
Address:
Dublin, Ireland
Editors:
Smaranda Muresan, Preslav Nakov, Aline Villavicencio
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1733–1745
Language:
URL:
https://aclanthology.org/2022.findings-acl.137
DOI:
10.18653/v1/2022.findings-acl.137
Bibkey:
Cite (ACL):
Simran Arora, Sen Wu, Enci Liu, and Christopher Re. 2022. Metadata Shaping: A Simple Approach for Knowledge-Enhanced Language Models. In Findings of the Association for Computational Linguistics: ACL 2022, pages 1733–1745, Dublin, Ireland. Association for Computational Linguistics.
Cite (Informal):
Metadata Shaping: A Simple Approach for Knowledge-Enhanced Language Models (Arora et al., Findings 2022)
Copy Citation:
PDF:
https://preview.aclanthology.org/naacl24-info/2022.findings-acl.137.pdf
Video:
 https://preview.aclanthology.org/naacl24-info/2022.findings-acl.137.mp4
Code
 simran-arora/metadatashaping
Data
FewRelOpen EntityTACRED