Jingbo Sun
Other people with similar names: Jingbo Sun
Unverified author pages with similar names: Jingbo Sun
2026
Beyond Query Memorization: Large Language Model Routing with Query Decomposition and Historical Matching
Bo Lv | Jingbo Sun | Jianwei Lv | Chen Tang | Shaojie Zhang | Nayu Liu | Guoxin Yu | Zihao Li | Qichao Zhang | Dongbin Zhao | Ping Luo | Yue Yu
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Bo Lv | Jingbo Sun | Jianwei Lv | Chen Tang | Shaojie Zhang | Nayu Liu | Guoxin Yu | Zihao Li | Qichao Zhang | Dongbin Zhao | Ping Luo | Yue Yu
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Optimizing the trade-off among predictive performance and computational cost is a central focus in the deployment of Large Language Models (LLMs). Current routing methods primarily rely on direct mapping from queries to models based on surface-level features, making them susceptible to the memorization trap and leading to poor generalizability on out-of-distribution (OOD) data. In this paper, we propose DecoR, a novel routing framework that recasts the routing task as a matching process of sifting similar queries from historical logs, effectively mitigating the memorization trap. To enhance matching accuracy, we introduce a query capability deconstruction method that decouples linguistic surface forms from task-intrinsic requirements, directing matching toward capability dimensions to ground decisions in essential task attributes. Furthermore, we develop CodaSet, a comprehensive benchmark for assessing routing generalization, where experimental results demonstrate that DecoR maintains superior accuracy while substantially lowering inference costs across both in-distribution and OOD settings.