Abstract
Sentiment analysis in low-resource languages suffers from a lack of annotated corpora to estimate high-performing models. Machine translation and bilingual word embeddings provide some relief through cross-lingual sentiment approaches. However, they either require large amounts of parallel data or do not sufficiently capture sentiment information. We introduce Bilingual Sentiment Embeddings (BLSE), which jointly represent sentiment information in a source and target language. This model only requires a small bilingual lexicon, a source-language corpus annotated for sentiment, and monolingual word embeddings for each language. We perform experiments on three language combinations (Spanish, Catalan, Basque) for sentence-level cross-lingual sentiment classification and find that our model significantly outperforms state-of-the-art methods on four out of six experimental setups, as well as capturing complementary information to machine translation. Our analysis of the resulting embedding space provides evidence that it represents sentiment information in the resource-poor target language without any annotated data in that language.- Anthology ID:
- P18-1231
- Volume:
- Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
- Month:
- July
- Year:
- 2018
- Address:
- Melbourne, Australia
- Venue:
- ACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 2483–2493
- Language:
- URL:
- https://aclanthology.org/P18-1231
- DOI:
- 10.18653/v1/P18-1231
- Cite (ACL):
- Jeremy Barnes, Roman Klinger, and Sabine Schulte im Walde. 2018. Bilingual Sentiment Embeddings: Joint Projection of Sentiment Across Languages. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 2483–2493, Melbourne, Australia. Association for Computational Linguistics.
- Cite (Informal):
- Bilingual Sentiment Embeddings: Joint Projection of Sentiment Across Languages (Barnes et al., ACL 2018)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/P18-1231.pdf
- Code
- jbarnesspain/blse
- Data
- MultiBooked