StanceAttack: Adversarial Attack for Stance Detection

Chenye Zhao, Cornelia Caragea


Abstract
Stance detection aims to ascertain whether an author’s text is in favor, against, or neutral toward specific targets like public policies or social issues. While pretrained language models (PLMs) have greatly enhanced stance detection, they remain vulnerable to adversarial attacks—manipulations that maintain textual semantics but lead to incorrect predictions. Such vulnerabilities remain underexplored for stance detection. In this study, we introduce StanceAttack, an innovative adversarial attack method leveraging ChatGPT to create adversarial examples that can mislead well-trained stance detection models. We conduct experiments to evaluate our attack model by attacking state-of-the-art PLMs on two benchmark datasets. Results demonstrate that StanceAttack outperforms traditional adversarial methods with higher success rates and fewer retries. Human evaluations confirm that our adversarial examples preserve the original semantic meanings and naturalness. We share our code and data in https://github.com/chenyez/StanceAttack.
Anthology ID:
2026.findings-acl.2034
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
40954–40971
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.2034/
DOI:
Bibkey:
Cite (ACL):
Chenye Zhao and Cornelia Caragea. 2026. StanceAttack: Adversarial Attack for Stance Detection. In Findings of the Association for Computational Linguistics: ACL 2026, pages 40954–40971, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
StanceAttack: Adversarial Attack for Stance Detection (Zhao & Caragea, Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.2034.pdf
Checklist:
 2026.findings-acl.2034.checklist.pdf