ImF: Embedding an Implicit Fingerprint in Your Large Language Models

Jiaxuan Wu, Wanli Peng, Hang fu, Xue Yiming, Juan Wen


Abstract
Training and serving large language models (LLMs) is resource-intensive, making reliable intellectual property (IP) protection and black-box ownership verification increasingly important.Model fingerprinting enables such verification by injecting a small set of secret query–response behaviors, but many existing fingerprints rely on explicit markers or predetermined outputs that are weakly grounded in prompt semantics.This semantic mismatch yields atypical fingerprint responses, reduces stealthiness, and exposes fingerprints to removal by response normalization.We formalize this vulnerability via a new removal attack, Generation Revision Intervention (GRI), which applies system-prompt-level revision and response standardization to steer models toward typical answers, substantially compromising representative injected baselines.To close this semantic gap, we propose the Implicit Fingerprints (ImF): we encode ownership information into a natural-looking target response y via linguistic steganography, then derive a CoT-augmented query x that embeds semantic cues from y to guide the model toward an output sufficiently close to y for decoding-based verification.Experiments on 15 LLMs show that ImF improves stealthiness and remains verifiable under model updates and deployment-time prompt interventions; additional analyses further show stability under common decoding variation and realistic related-model partial merging.
Anthology ID:
2026.acl-long.1183
Volume:
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
25804–25825
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.acl-long.1183/
DOI:
Bibkey:
Cite (ACL):
Jiaxuan Wu, Wanli Peng, Hang fu, Xue Yiming, and Juan Wen. 2026. ImF: Embedding an Implicit Fingerprint in Your Large Language Models. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 25804–25825, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
ImF: Embedding an Implicit Fingerprint in Your Large Language Models (Wu et al., ACL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.acl-long.1183.pdf
Checklist:
 2026.acl-long.1183.checklist.pdf