Your Semantic-Independent Watermark is Fragile: A Semantic Perturbation Attack against EaaS Watermark

Zekun Fei, Biao Yi, Jianing Geng, He Ruiqi, Lihai Nie, Zheli Liu


Abstract
Embedding-as-a-Service (EaaS) has emerged as a successful business pattern but faces significant challenges related to various forms of copyright infringement, particularly the API misuse and model extraction attacks. Various studies have proposed backdoor-based watermarking schemes to protect the copyright of EaaS services. In this paper, we reveal that previous watermarking schemes possess semantic-independent characteristics and propose the Semantic Perturbation Attack (SPA). Our theoretical and experimental analysis demonstrates that this semantic-independent nature makes current watermarking schemes vulnerable to adaptive attacks that exploit semantic perturbation tests to bypass watermark verification. Extensive experimental results across multiple datasets demonstrate that the True Positive Rate (TPR) for identifying watermarked samples under SPA can reach up to more than 95%, rendering watermarks ineffective while maintaining the high utility of the embeddings. In addition, we discuss current potential defense strategies to mitigate SPA. Our code is available at https://github.com/Zk4-ps/EaaS-Embedding-Watermark.
Anthology ID:
2025.findings-emnlp.192
Volume:
Findings of the Association for Computational Linguistics: EMNLP 2025
Month:
November
Year:
2025
Address:
Suzhou, China
Editors:
Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
3600–3614
Language:
URL:
https://preview.aclanthology.org/author-page-yu-wang-polytechnic/2025.findings-emnlp.192/
DOI:
10.18653/v1/2025.findings-emnlp.192
Bibkey:
Cite (ACL):
Zekun Fei, Biao Yi, Jianing Geng, He Ruiqi, Lihai Nie, and Zheli Liu. 2025. Your Semantic-Independent Watermark is Fragile: A Semantic Perturbation Attack against EaaS Watermark. In Findings of the Association for Computational Linguistics: EMNLP 2025, pages 3600–3614, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):
Your Semantic-Independent Watermark is Fragile: A Semantic Perturbation Attack against EaaS Watermark (Fei et al., Findings 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/author-page-yu-wang-polytechnic/2025.findings-emnlp.192.pdf
Checklist:
 2025.findings-emnlp.192.checklist.pdf