PivotAttack: Rethinking the Search Trajectory in Hard-Label Text Attacks via Pivot Words

Yuzhi Liang, Shiliang Xiao, Jingsong Wei, Qiliang Lin, Xia Li


Abstract
Existing hard-label text attacks often rely on inefficient "outside-in" strategies that traverse vast search spaces. We propose PivotAttack, a query-efficient "inside-out" framework. It employs a Multi-Armed Bandit algorithm to identify Pivot Sets—combinatorial token groups acting as prediction anchors—and strategically perturbs them to induce label flips. This approach captures inter-word dependencies and minimizes query costs. Extensive experiments across traditional models and Large Language Models demonstrate that PivotAttack consistently outperforms state-of-the-art baselines in both Attack Success Rate and query efficiency.
Anthology ID:
2026.findings-acl.157
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
3190–3207
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.157/
DOI:
Bibkey:
Cite (ACL):
Yuzhi Liang, Shiliang Xiao, Jingsong Wei, Qiliang Lin, and Xia Li. 2026. PivotAttack: Rethinking the Search Trajectory in Hard-Label Text Attacks via Pivot Words. In Findings of the Association for Computational Linguistics: ACL 2026, pages 3190–3207, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
PivotAttack: Rethinking the Search Trajectory in Hard-Label Text Attacks via Pivot Words (Liang et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.157.pdf
Checklist:
 2026.findings-acl.157.checklist.pdf