OSCR-Attack: One-Shot Character Level Attacks through Self-Optimizing Continuous Relaxation

Lingyi Kong, Zhuo Liu, Zhanghao Hu, Qilong Qiu, Yutao Yang, Jingjing Xue, Zheng Wang, Lin Gui, Feiping Nie


Abstract
Adversarial attacks have attracted growing attention across domains, including natural language processing (NLP). Character-level adversarial attacks preserve semantics, but they have received less attention because the discrete operations they use are costly and inefficient. Challenging these beliefs, we introduce two adaptively learnable matrices that transform discrete choices into continuous representations, enabling automatic one-shot multi-position, multi-character insertion. To optimize the two learnable matrices, we propose OSCR-Attack, an end-to-end framework based on gradient-based optimization, with a conflict resolution strategy that maps the optimized continuous distributions back into discrete insertion operations. Extensive experiments on three benchmarks with three open-source large language models (LLMs) show that OSCR-Attack improves attack success rate (ASR) by up to 21.45% points and accelerates the attack by up to 3.66 times compared to recent baselines.
Anthology ID:
2026.findings-acl.1235
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
24668–24683
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1235/
DOI:
Bibkey:
Cite (ACL):
Lingyi Kong, Zhuo Liu, Zhanghao Hu, Qilong Qiu, Yutao Yang, Jingjing Xue, Zheng Wang, Lin Gui, and Feiping Nie. 2026. OSCR-Attack: One-Shot Character Level Attacks through Self-Optimizing Continuous Relaxation. In Findings of the Association for Computational Linguistics: ACL 2026, pages 24668–24683, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
OSCR-Attack: One-Shot Character Level Attacks through Self-Optimizing Continuous Relaxation (Kong et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1235.pdf
Checklist:
 2026.findings-acl.1235.checklist.pdf