Yuchen Li
Other people with similar names: Yuchen Li
Unverified author pages with similar names: Yuchen Li
2026
On the Representation Geometry of LoRA Model Merging
Chenyang Lu | Jiaru Li | Jinman Zhao | Xinran Chen | Yining Wang | Renyi Cai | Yuchen Li | Chao He
Findings of the Association for Computational Linguistics: ACL 2026
Chenyang Lu | Jiaru Li | Jinman Zhao | Xinran Chen | Yining Wang | Renyi Cai | Yuchen Li | Chao He
Findings of the Association for Computational Linguistics: ACL 2026
Low-Rank Adaptation (LoRA) is widely used for parameter-efficient fine-tuning, yet merging multiple task-specific LoRA updates without additional training remains challenging. Most existing LoRA merging methods rely on SVD-based alignment, which emphasizes globally shared structure across tasks. In this work, we show that LoRA merging performance can be further improved by combining SVD with CUR decomposition. Through a representation-level analysis, we find that SVD-based decompositions primarily model shared components across tasks, while CUR-based decompositions better preserve task-specific and localized updates. These two perspectives are geometrically misaligned and exhibit complementary advantages, revealing an inherent trade-off between capturing shared structure and preserving task-specific information in LoRA model merging. Guided by this analysis, we propose a training-free merging procedure that explicitly combines the shared structure captured by SVD with the task-specific components preserved by CUR. Experiments on both vision and language benchmarks demonstrate consistent improvements over existing gradient-free LoRA merging methods.
Agentic-R: Learning to Retrieve for Agentic Search
Wenhan Liu | Xinyu Ma | Yutao Zhu | Yuchen Li | Daiting Shi | Dawei Yin | Zhicheng Dou
Findings of the Association for Computational Linguistics: ACL 2026
Wenhan Liu | Xinyu Ma | Yutao Zhu | Yuchen Li | Daiting Shi | Dawei Yin | Zhicheng Dou
Findings of the Association for Computational Linguistics: ACL 2026
Agentic search has recently emerged as a powerful paradigm, where an agent interleaves multi-step reasoning with on-demand retrieval to solve complex questions. Despite its success, how to design a retriever for agentic search remains largely underexplored. Existing search agents typically rely on similarity-based retrievers, while similar passages are not always useful for final answer generation. In this paper, we propose a novel retriever training framework tailored for agentic search. Unlike retrievers designed for single-turn retrieval-augmented generation (RAG) that only rely on local passage utility, we propose to use both local query-passage relevance and global answer correctness to measure passage utility in a multi-turn agentic search. We further introduce an iterative training strategy, where the search agent and the retriever are optimized bidirectionally and iteratively. Different from RAG retrievers that are only trained once with fixed questions, our retriever is continuously improved using evolving and higher-quality queries from the agent. Extensive experiments on seven single-hop and multi-hop QA benchmarks demonstrate that our retriever, termed Agentic-R, consistently outperforms strong baselines across different search agents.
Dr. Assistant: Enhancing Clinical Diagnostic Inquiry via Structured Diagnostic Reasoning Data and Reinforcement Learning
Yue Guo | Fanfu Wang | Jianwei Lv | Xincheng Shi | Yuchen Li | Youya Wang | Yunsheng Zeng | Yujing Liu | Yunhao Qiao | Gen Li | Junfeng Wang | Bo Yuan
Findings of the Association for Computational Linguistics: ACL 2026
Yue Guo | Fanfu Wang | Jianwei Lv | Xincheng Shi | Yuchen Li | Youya Wang | Yunsheng Zeng | Yujing Liu | Yunhao Qiao | Gen Li | Junfeng Wang | Bo Yuan
Findings of the Association for Computational Linguistics: ACL 2026
Clinical Decision Support Systems (CDSSs) provide reasoning and inquiry guidance for physicians, yet they face notable challenges, including high maintenance costs and low generalization capability.Recently, Large Language Models (LLMs) have been widely adopted in healthcare due to their extensive knowledge reserves, retrieval, and communication capabilities. While LLMs show promise and excel at medical benchmarks, their diagnostic reasoning and inquiry skills are constrained.To mitigate this issue, we propose (1) Clinical Diagnostic Reasoning Data (CDRD) structure to capture abstract clinical reasoning logic, and a pipeline for its construction, and (2) the Dr. Assistant, a clinical diagnostic model equipped with clinical reasoning and inquiry skills. Its training involves a two-stage process: SFT, followed by RL with a tailored reward function.We also introduce a benchmark to evaluate both diagnostic reasoning and inquiry.Our experiments demonstrate that the Dr. Assistant outperforms open-source models and achieves competitive performance to closed-source models, providing an effective solution for clinical diagnostic inquiry guidance. Project information can be found at: https://github.com/YGswu/Dr.-Assistant.
ReasonRank: Empowering Passage Ranking with Strong Reasoning Ability
Wenhan Liu | Xinyu Ma | Weiwei Sun | Yutao Zhu | Yuchen Li | Dawei Yin | Zhicheng Dou
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Wenhan Liu | Xinyu Ma | Weiwei Sun | Yutao Zhu | Yuchen Li | Dawei Yin | Zhicheng Dou
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Large Language Model (LLM) based listwise ranking has shown superior performance in many passage ranking tasks. With the development of Large Reasoning Models (LRMs), many studies have demonstrated that step-by-step reasoning during test-time helps improve listwise ranking performance. However, due to the scarcity of reasoning-intensive training data, existing rerankers perform poorly in many complex ranking scenarios, and the ranking ability of reasoning-intensive rerankers remains largely underdeveloped. In this paper, we first propose an automated reasoning-intensive training data synthesis framework, which sources training queries and passages from diverse domains and applies DeepSeek-R1 to generate high-quality training labels. To empower the listwise reranker with strong reasoning ability, we further propose a two-stage training approach, which includes a cold-start supervised fine-tuning (SFT) stage and a reinforcement learning (RL) stage. During the RL stage, we design a novel multi-view ranking reward tailored to the multi-turn nature of listwise ranking. Extensive experiments demonstrate that our trained reasoning-intensive reranker ReasonRank outperforms existing baselines significantly and also achieves much lower latency than the pointwise reranker.
Logic Jailbreak: Efficiently Unlocking LLM Safety Restrictions Through Formal Logical Expression
Jingyu Peng | Maolin Wang | Nan Wang | Jiatong Li | Yuchen Li | Yuyang Ye | Wanyu Wang | Pengyue Jia | Kai Zhang | Xiangyu Zhao
Findings of the Association for Computational Linguistics: ACL 2026
Jingyu Peng | Maolin Wang | Nan Wang | Jiatong Li | Yuchen Li | Yuyang Ye | Wanyu Wang | Pengyue Jia | Kai Zhang | Xiangyu Zhao
Findings of the Association for Computational Linguistics: ACL 2026
Despite substantial advancements in aligning LLMs with human values, current safety mechanisms remain susceptible to jailbreak attacks. We attribute this vulnerability to the distributional discrepancies between alignment-oriented prompts and malicious prompts. To investigate this, and drawing inspiration from logic-driven NLP tasks, we introduce LogiBreak, a universal black-box jailbreak method that utilizes logical expression translation to bypass LLM safety mechanisms. By converting harmful natural language prompts into formal logical expressions, LogiBreak exploits the distributional gap between alignment data and logic-expressed inputs, preserving the underlying semantic intent and readability while evading safety constraints. Furthermore, to fill the gap of existing benchmarks that lack systematic resources specifically targeting logical expression-based attacks against LLM robustness, we construct a novel multilingual logical expression jailbreak dataset for evaluation. Our evaluations of LogiBreak in five languages demonstrate its effectiveness and generalizability in various linguistic contexts. The code is available at https://github.com/Applied-Machine-Learning-Lab/ACL2026_Logibreak.
Adversarial Yet Cooperative: Multi-Perspective Reasoning in Retrieved-Augmented Language Models
Can Xu | Lingyong Yan | Jiayi Wu | Haosen Wang | Shuaiqiang Wang | Yuchen Li | Jizhou Huang | Dawei Yin | Xiang Li
Findings of the Association for Computational Linguistics: ACL 2026
Can Xu | Lingyong Yan | Jiayi Wu | Haosen Wang | Shuaiqiang Wang | Yuchen Li | Jizhou Huang | Dawei Yin | Xiang Li
Findings of the Association for Computational Linguistics: ACL 2026
Recent advances in synergizing large reasoning models (LRMs) with retrieval-augmented generation (RAG) have shown promising results, yet two critical challenges remain: (1) reasoning models typically operate from a single, unchallenged perspective, limiting their ability to conduct deep, self-correcting reasoning over external documents, and (2) existing training paradigms rely excessively on outcome-oriented rewards, which provide insufficient signal for shaping the complex, multi-step reasoning process. To address these issues, we propose an Reasoner-Verifier framework named Adversarial Reasoning RAG (ARR). The Reasoner and Verifier engage in reasoning on retrieved evidence and critiquing each other’s logic while being guided by process-aware advantage that requires no external scoring model. This reward combines explicit observational signals with internal model uncertainty to jointly optimize reasoning fidelity and verification rigor. Experiments on multiple benchmarks demonstrate the effectiveness of our method. Our code is available at [link](https://github.com/lakhfskn/anonymous-code-of-arr).
UniCreative: Unifying Long-form Logic and Short-form Sparkle via Reference-Free Reinforcement Learning
Xiaolong Wei | Zerun Zhu | Simin Niu | Xingyu Zhang | Peiying Yu | Changxuan Xiao | Yuchen Li | Jicheng Yang | Zhejun Zhao | Chong Meng | Long Xia | Daiting Shi
Findings of the Association for Computational Linguistics: ACL 2026
Xiaolong Wei | Zerun Zhu | Simin Niu | Xingyu Zhang | Peiying Yu | Changxuan Xiao | Yuchen Li | Jicheng Yang | Zhejun Zhao | Chong Meng | Long Xia | Daiting Shi
Findings of the Association for Computational Linguistics: ACL 2026
A fundamental challenge in creative writing lies in reconciling the inherent tension between maintaining global coherence in long-form narratives and preserving local expressiveness in short-form texts. While long-context generation necessitates explicit macroscopic planning, short-form creativity often demands spontaneous, constraint-free expression. Existing alignment paradigms, however, typically employ static reward signals and rely heavily on high-quality supervised data, which is costly and difficult to scale. To address this, we propose UniCreative, a unified reference-free reinforcement learning framework. We first introduce AC-GenRM, an adaptive constraint-aware reward model that dynamically synthesizes query-specific criteria to provide fine-grained preference judgments. Leveraging these signals, we propose ACPO, a policy optimization algorithm that aligns models with human preferences across both content quality and structural paradigms without supervised fine-tuning and ground-truth references. Empirical results demonstrate that AC-GenRM aligns closely with expert evaluations, while ACPO significantly enhances performance across diverse writing tasks. Crucially, our analysis reveals an emergent meta-cognitive ability: the model learns to autonomously differentiate between tasks requiring rigorous planning and those favoring direct generation, validating the effectiveness of our direct alignment approach.
2025
Enhancing Retrieval-Augmented Generation via Evidence Tree Search
Hao Sun | Hengyi Cai | Yuchen Li | Xuanbo Fan | Xiaochi Wei | Shuaiqiang Wang | Yan Zhang | Dawei Yin
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Hao Sun | Hengyi Cai | Yuchen Li | Xuanbo Fan | Xiaochi Wei | Shuaiqiang Wang | Yan Zhang | Dawei Yin
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Retrieval-Augmented Generation (RAG) is widely used to enhance Large Language Models (LLMs) by grounding responses in external knowledge. However, in real-world applications, retrievers often return lengthy documents with redundant or irrelevant content, confusing downstream readers. While evidence retrieval aims to address this by extracting key information, it faces critical challenges: (1) inability to model synergistic inter-dependencies among evidence sentences, (2) lack of supervision for evaluating multi-sentence evidence quality, and (3) computational inefficiency in navigating exponentially growing search spaces of candidate evidence sets. To tackle these challenges, we propose ETS (Evidence Tree Search), a novel framework that reformulates evidence retrieval as a dynamic tree expansion process. Our approach first constructs an evidence tree where each path represents a candidate evidence set, explicitly modeling inter-sentence dependencies through context-aware node selection. We then leverage Monte Carlo Tree Search (MCTS) to efficiently assess evidence quality and introduce an Early-Terminating Beam Search strategy to efficiently accelerate the model inference. Extensive experiments on five datasets demonstrate that ETS significantly outperforms existing methods across different readers. Our code and datasets will be released to facilitate future research.
Search
Fix author
Co-authors
- Dawei Yin 4
- Zhicheng Dou (窦志成) 2
- Wenhan Liu 2
- Xinyu Ma 2
- Daiting Shi 2
- Shuaiqiang Wang 2
- Yutao Zhu (朱余韬) 2
- Renyi Cai 1
- Hengyi Cai 1
- Xinran Chen 1
- Xuanbo Fan 1
- Yue Guo 1
- Chao He 1
- Jizhou Huang 1
- Pengyue Jia 1
- Jiaru Li 1
- Gen Li 1
- Jiatong Li 1
- Xiang Li 1
- Yujing Liu 1
- Chenyang Lu 1
- Jianwei Lv 1
- Chong Meng 1
- Simin Niu 1
- Jingyu Peng 1
- Yunhao Qiao 1
- Xincheng Shi 1
- Weiwei Sun 1
- Hao Sun 1
- Yining Wang 1
- Fanfu Wang 1
- Youya Wang 1
- Junfeng Wang 1
- Maolin Wang 1
- Nan Wang 1
- Wanyu Wang 1
- Haosen Wang 1
- Xiaolong Wei 1
- Xiaochi Wei 1
- Jiayi Wu 1
- Long Xia 1
- Changxuan Xiao 1
- Can Xu 1
- Lingyong Yan 1
- Jicheng Yang 1
- Yuyang Ye 1
- Peiying Yu 1
- Bo Yuan 1
- Yunsheng Zeng 1
- Kai Zhang 1
- Xingyu Zhang 1
- Yan Zhang 1
- Jinman Zhao 1
- Xiangyu Zhao 1
- Zhejun Zhao 1
- Zerun Zhu 1