Duolin Sun
2026
WebClipper: Efficient Evolution of Web Agents with Graph-based Trajectory Pruning
Junjie Wang | Zequn Xie | Dan Yang | Jie Feng | Yue Shen | Duolin Sun | Meixiu Long | Yihan Jiao | Zhehao Tan | Jian Wang | Peng Wei | Jinjie Gu
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Junjie Wang | Zequn Xie | Dan Yang | Jie Feng | Yue Shen | Duolin Sun | Meixiu Long | Yihan Jiao | Zhehao Tan | Jian Wang | Peng Wei | Jinjie Gu
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Deep Research systems based on web agents have shown strong potential in solving complex information-seeking tasks, yet their search efficiency remains underexplored. We observe that many state-of-the-art open-source web agents rely on long tool-call trajectories with cyclic reasoning loops and exploration of unproductive branches. To address this, we propose WebClipper, a framework that compresses web agent trajectories via graph-based pruning. Concretely, we model the agent’s search process as a state graph and cast trajectory optimization as a minimum-necessary Directed Acyclic Graph (DAG) mining problem, yielding pruned trajectories that preserve essential reasoning while eliminating redundant steps. Continued training on these refined trajectories enables the agent to evolve toward more efficient search patterns and reduces tool-call rounds by about 20% while improving accuracy. Furthermore, we introduce a new metric called F-AE Score to measure the model’s overall performance in balancing accuracy and efficiency. Experiments demonstrate that WebClipper compresses tool-call rounds under excellent performance, providing practical insight into balancing effectiveness and efficiency in web agent design.
GroupRank: A Groupwise Paradigm for Effective and Efficient Passage Reranking with LLMs
Meixiu Long | Duolin Sun | Dan Yang | Yihan Jiao | Lei Liu | Jiahai Wang | Binbin Hu | Yue Shen | Jie Feng | Zhehao Tan | Junjie Wang | Lianzhen Zhong | Jian Wang | Peng Wei | Jinjie Gu
Findings of the Association for Computational Linguistics: ACL 2026
Meixiu Long | Duolin Sun | Dan Yang | Yihan Jiao | Lei Liu | Jiahai Wang | Binbin Hu | Yue Shen | Jie Feng | Zhehao Tan | Junjie Wang | Lianzhen Zhong | Jian Wang | Peng Wei | Jinjie Gu
Findings of the Association for Computational Linguistics: ACL 2026
Large Language Models (LLMs) have emerged as powerful tools for passage reranking in information retrieval, leveraging their superior reasoning capabilities to address the limitations of conventional models on complex queries. However, current LLM-based reranking paradigms are fundamentally constrained by an efficiency-accuracy trade-off: (1) pointwise methods are efficient but ignore inter-document comparison, yielding suboptimal accuracy; (2) listwise methods capture global context but suffer from context-window constraints and prohibitive inference latency. To address these issues, we propose GroupRank, a novel paradigm that balances flexibility and context awareness. To unlock the full potential of groupwise reranking, we propose an answer-free data synthesis pipeline that fuses local pointwise signals with global listwise rankings. These samples facilitate supervised fine-tuning and reinforcement learning, with the latter guided by a specialized group-ranking reward comprising ranking-utility and group-alignment. These complementary components synergistically optimize document ordering and score calibration to reflect intrinsic query-document relevance.Experimental results show GroupRank achieves a state-of-the-art 65.2 NDCG@10 on BRIGHT and surpasses baselines by 2.1 points on R2MED, while delivering a 6.4× inference speedup. The code is available at https://github.com/AQ-MedAI/Diver/tree/main/Reranker/GroupRank.
2025
HIRAG: Hierarchical-Thought Instruction-Tuning Retrieval-Augmented Generation
Yihan Jiao | Zhehao Tan | Dan Yang | Duolin Sun | Jie Feng | Yue Shen | Jian Wang | Peng Wei
Findings of the Association for Computational Linguistics: EMNLP 2025
Yihan Jiao | Zhehao Tan | Dan Yang | Duolin Sun | Jie Feng | Yue Shen | Jian Wang | Peng Wei
Findings of the Association for Computational Linguistics: EMNLP 2025
Retrieval-augmented generation (RAG) has become a fundamental paradigm for addressing the challenges faced by large language models in handling real-time information and domain-specific problems. Traditional RAG systems primarily rely on the in-context learning (ICL) capabilities of the large language model itself. Still, in-depth research on the specific capabilities needed by the RAG generation model is lacking, leading to challenges with inconsistent document quality and retrieval system imperfections. Even the limited studies that fine-tune RAG generative models often lack a granular focus on RAG tasks or a deeper utilization of chain-of-thought processes. To address this, we propose that RAG models should possess three progressively hierarchical abilities (1) Filtering: the ability to select relevant information; (2) Combination: the ability to combine semantic information across paragraphs; and (3) RAG-specific reasoning: the ability to further process external knowledge using internal knowledge. Thus, we introduce our new RAG instruction fine-tuning method, Hierarchical-Thought Instruction-Tuning Retrieval-Augmented Generation (HIRAG) incorporates a “think before answering” strategy. This method enhances the model’s open-book examination capability by utilizing multi-level progressive chain-of-thought. Experiments show that the HIRAG training strategy significantly improves the model’s performance on datasets such as RGB, PopQA, MuSiQue, HotpotQA, and PubmedQA.