Adaptive Zooming via Relevance-Informed Positional Resource Allocation for Training-free LLM Context Extension

Hongbo Zhao; Huibin Wang; Bin Tang; Xianming Hu; Yihong Huang; Yijun Shen; Nuoyi Chen; Ping Li; Kai Zhang

Adaptive Zooming via Relevance-Informed Positional Resource Allocation for Training-free LLM Context Extension

Hongbo Zhao, Huibin Wang, Bin Tang, Xianming Hu, Yihong Huang, Yijun Shen, Nuoyi Chen, Ping Li, Kai Zhang

Abstract

Large Language Models exhibit degraded performance when extrapolating beyond training context lengths. Existing training-free methods like positional reuse or interpolation can alleviate this issue in an efficient manner. However, these strategies are semantics-agnostic by only considering relative token distances, which could indiscriminately blur semantically relevant and irrelevant tokens alike.To address this, we introduce an adaptive positional zooming method called **Relevance-Informed Positional Resource Allocation (RiPRA)**. RiPRA formulates positional encoding as a constrained resource allocation, in which a fixed positional budget is distributed across tokens in a longer context based on their semantic relevance to the query: relevant tokens get higher positional resolution, while irrelevant tokens (positions) are compressed. By doing this, RiPRA enables a dynamic and nonparametric positional zooming where the positional resolution is adaptively modulated across queries and network layers, effectively improving long-range context modeling and retrieval capacity. Besides, an isotonic smoothing is used to further enforce a global linear ordering relationship to preserve stability and generalization, together with a chunk-based hierarchical approximation to further reduce inference overhead. Extensive experiments across comprehensive benchmarks including LongBench, L-Eval, Passkey Retrieval, and PG19 demonstrate that RiPRA consistently outperforms existing training-free extrapolation methods, showing the value of relevance-conditioned positional encoding for long-context generalization.

Anthology ID:: 2026.findings-acl.1058
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 21067–21083
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1058/
DOI:
Bibkey:
Cite (ACL):: Hongbo Zhao, Huibin Wang, Bin Tang, Xianming Hu, Yihong Huang, Yijun Shen, Nuoyi Chen, Ping Li, and Kai Zhang. 2026. Adaptive Zooming via Relevance-Informed Positional Resource Allocation for Training-free LLM Context Extension. In Findings of the Association for Computational Linguistics: ACL 2026, pages 21067–21083, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Adaptive Zooming via Relevance-Informed Positional Resource Allocation for Training-free LLM Context Extension (Zhao et al., Findings 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1058.pdf
Checklist:: 2026.findings-acl.1058.checklist.pdf

PDF Cite Search Checklist Fix data