Distilling Salient Reviews with Zero Labels

Chieh-Yang Huang, Jinfeng Li, Nikita Bhutani, Alexander Whedon, Estevam Hruschka, Yoshi Suhara


Abstract
Many people read online reviews to learn about real-world entities of their interest. However, majority of reviews only describes general experiences and opinions of the customers, and may not reveal facts that are specific to the entity being reviewed. In this work, we focus on a novel task of mining from a review corpus sentences that are unique for each entity. We refer to this task as Salient Fact Extraction. Salient facts are extremely scarce due to their very nature. Consequently, collecting labeled examples for training supervised models is tedious and cost-prohibitive. To alleviate this scarcity problem, we develop an unsupervised method, ZL-Distiller, which leverages contextual language representations of the reviews and their distributional patterns to identify salient sentences about entities. Our experiments on multiple domains (hotels, products, and restaurants) show that ZL-Distiller achieves state-of-the-art performance and further boosts the performance of other supervised/unsupervised algorithms for the task. Furthermore, we show that salient sentences mined by ZL-Distiller provide unique and detailed information about entities, which benefit downstream NLP applications including question answering and summarization.
Anthology ID:
2022.fever-1.3
Volume:
Proceedings of the Fifth Fact Extraction and VERification Workshop (FEVER)
Month:
May
Year:
2022
Address:
Dublin, Ireland
Editors:
Rami Aly, Christos Christodoulopoulos, Oana Cocarascu, Zhijiang Guo, Arpit Mittal, Michael Schlichtkrull, James Thorne, Andreas Vlachos
Venue:
FEVER
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
16–28
Language:
URL:
https://aclanthology.org/2022.fever-1.3
DOI:
10.18653/v1/2022.fever-1.3
Bibkey:
Cite (ACL):
Chieh-Yang Huang, Jinfeng Li, Nikita Bhutani, Alexander Whedon, Estevam Hruschka, and Yoshi Suhara. 2022. Distilling Salient Reviews with Zero Labels. In Proceedings of the Fifth Fact Extraction and VERification Workshop (FEVER), pages 16–28, Dublin, Ireland. Association for Computational Linguistics.
Cite (Informal):
Distilling Salient Reviews with Zero Labels (Huang et al., FEVER 2022)
Copy Citation:
PDF:
https://preview.aclanthology.org/naacl-24-ws-corrections/2022.fever-1.3.pdf
Video:
 https://preview.aclanthology.org/naacl-24-ws-corrections/2022.fever-1.3.mp4