Jiaming Liu
Other people with similar names: Jiaming Liu
Unverified author pages with similar names: Jiaming Liu
2026
Hyperion: Private Token Sampling with Homomorphic Encryption
Lawrence Lim | Jiaming Liu | Vikas Kalagi | Divyakant Agrawal | Amr El Abbadi
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Lawrence Lim | Jiaming Liu | Vikas Kalagi | Divyakant Agrawal | Amr El Abbadi
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
A promising direction for enabling private queries to large language models (LLMs) is with homomorphic encryption (HE). An open problem is performing token sampling under HE. In this paper, we introduce Hyperion, an efficient HE algorithm for inverse transform sampling, enabling private token sampling with 1 comparison depth, O(1) amortized comparisons, and O(log n) rotations. We implement our approach and demonstrate that it samples tokens in 0.14 seconds for 32k tokens (≈ 4.4\ 𝜇 s per token) on GPU, achieving a 100× latency improvement over prior work.