Don’t Click That: Teaching Web Agents to Resist Deceptive Interfaces

Yilin Zhang, Yingkai Hua, Chunyu Wei, Xin Wang, Yueguo Chen


Abstract
Vision-language model (VLM) based web agents demonstrate impressive autonomous GUI interaction but remain vulnerable to deceptive interface elements. Existing approaches either detect deception without task integration or document attacks without proposing defenses. We formalize deception-aware web agent defense and propose DUDE (Deceptive UI Detector Evaluator), a two-stage framework combining hybrid-reward learning with asymmetric penalties and experience summarization to distill failure patterns into transferable guidance. We introduce RUC (Real UI Clickboxes), a benchmark of 1,407 scenarios spanning four domains and deception categories. Experiments show DUDE reduces deception susceptibility by 53.8% while maintaining task performance, establishing an effective foundation for robust web agent deployment.
Anthology ID:
2026.acl-long.310
Volume:
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
6830–6852
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.acl-long.310/
DOI:
Bibkey:
Cite (ACL):
Yilin Zhang, Yingkai Hua, Chunyu Wei, Xin Wang, and Yueguo Chen. 2026. Don’t Click That: Teaching Web Agents to Resist Deceptive Interfaces. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 6830–6852, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
Don’t Click That: Teaching Web Agents to Resist Deceptive Interfaces (Zhang et al., ACL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.acl-long.310.pdf
Checklist:
 2026.acl-long.310.checklist.pdf