Sentiment as an Ordinal Latent Variable

Niklas Stoehr, Ryan Cotterell, Aaron Schein


Abstract
Sentiment analysis has become a central tool in various disciplines outside of natural language processing. In particular in applied and domain-specific settings with strong requirements for interpretable methods, dictionary-based approaches are still a popular choice. However, existing dictionaries are often limited in coverage, static once annotation is completed and sentiment scales differ widely; some are discrete others continuous. We propose a Bayesian generative model that learns a composite sentiment dictionary as an interpolation between six existing dictionaries with different scales. We argue that sentiment is a latent concept with intrinsically ranking-based characteristics — the word “excellent” may be ranked more positive than “great” and “okay”, but it is hard to express how much more exactly. This prompts us to enforce an ordinal scale of ordered discrete sentiment values in our dictionary. We achieve this through an ordering transformation in the priors of our model. We evaluate the model intrinsically by imputing missing values in existing dictionaries. Moreover, we conduct extrinsic evaluations through sentiment classification tasks. Finally, we present two extension: first, we present a method to augment dictionary-based approaches with word embeddings to construct sentiment scales along new semantic axes. Second, we demonstrate a Latent Dirichlet Allocation-inspired variant of our model that learns document topics that are ordered by sentiment.
Anthology ID:
2023.eacl-main.8
Volume:
Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics
Month:
May
Year:
2023
Address:
Dubrovnik, Croatia
Editors:
Andreas Vlachos, Isabelle Augenstein
Venue:
EACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
103–115
Language:
URL:
https://preview.aclanthology.org/icon-24-ingestion/2023.eacl-main.8/
DOI:
10.18653/v1/2023.eacl-main.8
Bibkey:
Cite (ACL):
Niklas Stoehr, Ryan Cotterell, and Aaron Schein. 2023. Sentiment as an Ordinal Latent Variable. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, pages 103–115, Dubrovnik, Croatia. Association for Computational Linguistics.
Cite (Informal):
Sentiment as an Ordinal Latent Variable (Stoehr et al., EACL 2023)
Copy Citation:
PDF:
https://preview.aclanthology.org/icon-24-ingestion/2023.eacl-main.8.pdf
Video:
 https://preview.aclanthology.org/icon-24-ingestion/2023.eacl-main.8.mp4