Benchmarking Web Agent Safety under E-commerce Deceptive Interfaces

Zijing Shi; Meng Fang; Ling Chen

Benchmarking Web Agent Safety under E-commerce Deceptive Interfaces

Abstract

As autonomous web agents are increasingly deployed to perform real-world tasks, ensuring their safety has become a critical concern. In this work, we study web agent behavior under realistic deceptive interfaces in the e-commerce domain. We introduce WebDecept, a lightweight and configurable plugin framework that enables controlled injection of deceptive interface patterns into existing web environments. Using WebDecept, we instantiate seven deceptive patterns commonly observed on the open web, including targeted advertisements, domain redirection, and shopping manipulation. By injecting these patterns into the frontend during task execution, we perform controlled evaluation of multiple multimodal web agents. Our results show that current web agents are highly susceptible to multiple classes of deceptive interfaces, and that prompt-based constraints are often insufficient to mitigate these failures. We further analyze how the design choices of deceptive patterns influence the success of such manipulations. These findings highlight safety challenges that should be addressed as web agents are scaled toward real-world deployment.

Anthology ID:: 2026.acl-long.1009
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 22090–22103
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.1009/
DOI:
Bibkey:
Cite (ACL):: Zijing Shi, Meng Fang, and Ling Chen. 2026. Benchmarking Web Agent Safety under E-commerce Deceptive Interfaces. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 22090–22103, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Benchmarking Web Agent Safety under E-commerce Deceptive Interfaces (Shi et al., ACL 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.acl-long.1009.pdf
Checklist:: 2026.acl-long.1009.checklist.pdf

PDF Cite Search Checklist Fix data