GraphKV: Breaking the Static Selection Paradigm with Graph-Based KV Cache Eviction

Xuelin Li; Xiangqi Jin; Linfeng Zhang

doi:10.18653/v1/2025.emnlp-main.1112

GraphKV: Breaking the Static Selection Paradigm with Graph-Based KV Cache Eviction

Abstract

Efficient Key-Value (KV) cache management is essential for processing long text sequences in large language models (LLMs), where memory constraints often limit performance. Conventional KV eviction strategies, such as top-k selection based on attention scores, depend on static heuristics that fail to capture the evolving implicit dependencies among tokens during inference. To overcome this, we propose GraphKV, a graph-based framework that redefines token selection for KV cache compression. In GraphKV, tokens are modeled as nodes with importance scores, and edges represent their similarity relationships. Through a decay-signal-propagation mechanism, token importance is dynamically updated by propagating information across the graph, enabling adaptive retention of the most contextually significant tokens. GraphKV can be seamlessly utilized in existing KV cache eviction methods such as SnapKV and PyramidKV in a plug-and-play manner. Codes are available in the supplementary materials and will be released on Github.

Anthology ID:: 2025.emnlp-main.1112
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 21910–21920
Language:
URL:: https://preview.aclanthology.org/name-variant-enfa-fane/2025.emnlp-main.1112/
DOI:: 10.18653/v1/2025.emnlp-main.1112
Bibkey:
Cite (ACL):: Xuelin Li, Xiangqi Jin, and Linfeng Zhang. 2025. GraphKV: Breaking the Static Selection Paradigm with Graph-Based KV Cache Eviction. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 21910–21920, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: GraphKV: Breaking the Static Selection Paradigm with Graph-Based KV Cache Eviction (Li et al., EMNLP 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/name-variant-enfa-fane/2025.emnlp-main.1112.pdf
Checklist:: 2025.emnlp-main.1112.checklist.pdf

PDF Cite Search Checklist Fix data