PivotAttack: Rethinking the Search Trajectory in Hard-Label Text Attacks via Pivot Words
Yuzhi Liang, Shiliang Xiao, Jingsong Wei, Qiliang Lin, Xia Li
Abstract
Existing hard-label text attacks often rely on inefficient "outside-in" strategies that traverse vast search spaces. We propose PivotAttack, a query-efficient "inside-out" framework. It employs a Multi-Armed Bandit algorithm to identify Pivot Sets—combinatorial token groups acting as prediction anchors—and strategically perturbs them to induce label flips. This approach captures inter-word dependencies and minimizes query costs. Extensive experiments across traditional models and Large Language Models demonstrate that PivotAttack consistently outperforms state-of-the-art baselines in both Attack Success Rate and query efficiency.- Anthology ID:
- 2026.findings-acl.157
- Volume:
- Findings of the Association for Computational Linguistics: ACL 2026
- Month:
- July
- Year:
- 2026
- Address:
- San Diego, California, United States
- Editors:
- Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
- Venue:
- Findings
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 3190–3207
- Language:
- URL:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.157/
- DOI:
- Cite (ACL):
- Yuzhi Liang, Shiliang Xiao, Jingsong Wei, Qiliang Lin, and Xia Li. 2026. PivotAttack: Rethinking the Search Trajectory in Hard-Label Text Attacks via Pivot Words. In Findings of the Association for Computational Linguistics: ACL 2026, pages 3190–3207, San Diego, California, United States. Association for Computational Linguistics.
- Cite (Informal):
- PivotAttack: Rethinking the Search Trajectory in Hard-Label Text Attacks via Pivot Words (Liang et al., Findings 2026)
- PDF:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.157.pdf