Abstract
We present a neural framework for opinion summarization from online product reviews which is knowledge-lean and only requires light supervision (e.g., in the form of product domain labels and user-provided ratings). Our method combines two weakly supervised components to identify salient opinions and form extractive summaries from multiple reviews: an aspect extractor trained under a multi-task objective, and a sentiment predictor based on multiple instance learning. We introduce an opinion summarization dataset that includes a training set of product reviews from six diverse domains and human-annotated development and test sets with gold standard aspect annotations, salience labels, and opinion summaries. Automatic evaluation shows significant improvements over baselines, and a large-scale study indicates that our opinion summaries are preferred by human judges according to multiple criteria.- Anthology ID:
- D18-1403
- Volume:
- Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
- Month:
- October-November
- Year:
- 2018
- Address:
- Brussels, Belgium
- Venue:
- EMNLP
- SIG:
- SIGDAT
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 3675–3686
- Language:
- URL:
- https://aclanthology.org/D18-1403
- DOI:
- 10.18653/v1/D18-1403
- Cite (ACL):
- Stefanos Angelidis and Mirella Lapata. 2018. Summarizing Opinions: Aspect Extraction Meets Sentiment Prediction and They Are Both Weakly Supervised. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pages 3675–3686, Brussels, Belgium. Association for Computational Linguistics.
- Cite (Informal):
- Summarizing Opinions: Aspect Extraction Meets Sentiment Prediction and They Are Both Weakly Supervised (Angelidis & Lapata, EMNLP 2018)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/D18-1403.pdf
- Code
- stangelid/oposum + additional community code
- Data
- OpoSum