Multi-Domain Targeted Sentiment Analysis

Orith Toledo-Ronen, Matan Orbach, Yoav Katz, Noam Slonim


Abstract
Targeted Sentiment Analysis (TSA) is a central task for generating insights from consumer reviews. Such content is extremely diverse, with sites like Amazon or Yelp containing reviews on products and businesses from many different domains. A real-world TSA system should gracefully handle that diversity. This can be achieved by a multi-domain model – one that is robust to the domain of the analyzed texts, and performs well on various domains. To address this scenario, we present a multi-domain TSA system based on augmenting a given training set with diverse weak labels from assorted domains. These are obtained through self-training on the Yelp reviews corpus. Extensive experiments with our approach on three evaluation datasets across different domains demonstrate the effectiveness of our solution. We further analyze how restrictions imposed on the available labeled data affect the performance, and compare the proposed method to the costly alternative of manually gathering diverse TSA labeled data. Our results and analysis show that our approach is a promising step towards a practical domain-robust TSA system.
Anthology ID:
2022.naacl-main.198
Volume:
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Month:
July
Year:
2022
Address:
Seattle, United States
Editors:
Marine Carpuat, Marie-Catherine de Marneffe, Ivan Vladimir Meza Ruiz
Venue:
NAACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
2751–2762
Language:
URL:
https://aclanthology.org/2022.naacl-main.198
DOI:
10.18653/v1/2022.naacl-main.198
Bibkey:
Cite (ACL):
Orith Toledo-Ronen, Matan Orbach, Yoav Katz, and Noam Slonim. 2022. Multi-Domain Targeted Sentiment Analysis. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 2751–2762, Seattle, United States. Association for Computational Linguistics.
Cite (Informal):
Multi-Domain Targeted Sentiment Analysis (Toledo-Ronen et al., NAACL 2022)
Copy Citation:
PDF:
https://preview.aclanthology.org/nschneid-patch-5/2022.naacl-main.198.pdf
Video:
 https://preview.aclanthology.org/nschneid-patch-5/2022.naacl-main.198.mp4
Data
MAMSYASO