Wei Dai
Other people with similar names: Wei Dai, Wei Dai, Wei Dai, Wei Dai
Unverified author pages with similar names: Wei Dai
2026
The Challenge of Identifying the Origin of Black-Box Large Language Models
Ziqing Yang | Yixin Wu | Yun Shen | Wei Dai | Michael Backes | Yang Zhang
Proceedings of the Seventh Workshop on Privacy in Natural Language Processing
Ziqing Yang | Yixin Wu | Yun Shen | Wei Dai | Michael Backes | Yang Zhang
Proceedings of the Seventh Workshop on Privacy in Natural Language Processing
The tremendous commercial potential of large language models (LLMs) has heightened concerns over their unauthorized use. To address this, we focus on the task of identifying the origin of black-box LLMs. We further propose PlugAE, an effective and efficient identification method that proactively leverages LLM-specific adversarial embeddings and allows users to customize copyright tokens on a targeted query set. Extensive experiments demonstrate that PlugAE outperforms both state-of-the-art model watermarking and fingerprinting methods in accuracy and robustness. We further analyze its stealthiness and reliability from three complementary perspectives and conduct ablation studies under various configurations, confirming its practicality for real-world misuse detection.