On the (In-)Security of the Shuffling Defense in the Transformer Secure Inference

Zhengyi Li, Yakai Wang, Jingwen Leng, Kang Yang, Yu Yu, Jiaping Gui, Yu Feng, Ning Liu, Minyi Guo


Abstract
For Transformer models, cryptographically secure inference ensures that the client learns only the final output, while the server learns nothing about the client’s input. However, securely computing nonlinear layers remains a major efficiency bottleneck due to the substantial communication rounds and data transmission required. To address this issue, prior works reveal intermediate activations to the client, allowing nonlinear operations to be computed in plaintext. Although this approach significantly improves efficiency, exposing activations enables adversaries to extract model weights. To mitigate this risk, existing works employ a shuffling defense that reveals only randomly permuted activations to the client. In this work, we show that the shuffling defense is not as robust as previously claimed. We propose an attack that aligns differently shuffled activations to a common permutation and subsequently exploits them to extract model weights. Experiments on Pythia-70m and GPT-2 demonstrate that the proposed attack can align shuffled activations with mean squared errors ranging from 10-9 to 10-6. With a query cost of approximately $1, the adversary can recover model weights with L1-norm differences ranging from 10-4 to 10-2 compared to the oracle weights.
Anthology ID:
2026.acl-long.1341
Volume:
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
29084–29098
Language:
URL:
https://preview.aclanthology.org/check-for-anonymous-pdfs/2026.acl-long.1341/
DOI:
Bibkey:
Cite (ACL):
Zhengyi Li, Yakai Wang, Jingwen Leng, Kang Yang, Yu Yu, Jiaping Gui, Yu Feng, Ning Liu, and Minyi Guo. 2026. On the (In-)Security of the Shuffling Defense in the Transformer Secure Inference. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 29084–29098, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
On the (In-)Security of the Shuffling Defense in the Transformer Secure Inference (Li et al., ACL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/check-for-anonymous-pdfs/2026.acl-long.1341.pdf
Checklist:
 2026.acl-long.1341.checklist.pdf