LACA: Improving Cross-lingual Aspect-Based Sentiment Analysis with LLM Data Augmentation

Jakub Šmíd; Pavel Přibáň; Pavel Král

LACA: Improving Cross-lingual Aspect-Based Sentiment Analysis with LLM Data Augmentation

Abstract

Cross-lingual aspect-based sentiment analysis (ABSA) involves detailed sentiment analysis in a target language by transferring knowledge from a source language with available annotated data. Most existing methods depend heavily on often unreliable translation tools to bridge the language gap. In this paper, we propose a new approach that leverages a large language model (LLM) to generate high-quality pseudo-labelled data in the target language without the need for translation tools. First, the framework trains an ABSA model to obtain predictions for unlabelled target language data. Next, LLM is prompted to generate natural sentences that better represent these noisy predictions than the original text. The ABSA model is then further fine-tuned on the resulting pseudo-labelled dataset. We demonstrate the effectiveness of this method across six languages and five backbone models, surpassing previous state-of-the-art translation-based approaches. The proposed framework also supports generative models, and we show that fine-tuned LLMs outperform smaller multilingual models.

Anthology ID:: 2025.acl-long.41
Volume:: Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 839–853
Language:
URL:: https://preview.aclanthology.org/ingestion-acl-25/2025.acl-long.41/
DOI:
Bibkey:
Cite (ACL):: Jakub Šmíd, Pavel Priban, and Pavel Kral. 2025. LACA: Improving Cross-lingual Aspect-Based Sentiment Analysis with LLM Data Augmentation. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 839–853, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: LACA: Improving Cross-lingual Aspect-Based Sentiment Analysis with LLM Data Augmentation (Šmíd et al., ACL 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingestion-acl-25/2025.acl-long.41.pdf

PDF Cite Search Fix data