SERM: Self-Evolving Relevance Model with Agent-Driven Learning from Massive Query Streams

Chenglong Wang, Canjia Li, Xingzhao Zhu, Yifu Huo, Huiyu Wang, Weixiong Lin, Yun Yang, Qiaozhi He, Tian Hua Zhou, Changxiaojia, JingBo Zhu, Tong Xiao


Abstract
Due to the dynamically evolving nature of real-world query streams, relevance models struggle to generalize to practical search scenarios. A sophisticated solution is self-evolution techniques. However, in large-scale industrial settings with massive query streams, this technique faces two challenges: (1) informative samples are often sparse and difficult to identify, and (2) pseudo-labels generated by the current model could be unreliable. To address these challenges, in this work, we propose a Self-Evolving Relevance Model approach (SERM), which comprises two complementary multi-agent modules: a multi-agent sample miner, designed to detect distributional shifts and identify informative training samples, and a multi-agent relevance annotator, which provides reliable labels through a two-level agreement framework. We evaluated SERM on a large-scale industrial platform, which serves billions of user requests daily. Experimental results demonstrate that SERM can achieve significant performance gains through iterative self-evolution, as validated by extensive offline multilingual evaluations and online testing.
Anthology ID:
2026.findings-acl.823
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
16687–16706
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.823/
DOI:
Bibkey:
Cite (ACL):
Chenglong Wang, Canjia Li, Xingzhao Zhu, Yifu Huo, Huiyu Wang, Weixiong Lin, Yun Yang, Qiaozhi He, Tian Hua Zhou, Changxiaojia, JingBo Zhu, and Tong Xiao. 2026. SERM: Self-Evolving Relevance Model with Agent-Driven Learning from Massive Query Streams. In Findings of the Association for Computational Linguistics: ACL 2026, pages 16687–16706, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
SERM: Self-Evolving Relevance Model with Agent-Driven Learning from Massive Query Streams (Wang et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.823.pdf
Checklist:
 2026.findings-acl.823.checklist.pdf