Xinyu Wang
McGill
Other people with similar names: Xinyu Wang (Warwick, King’s College London), Xinyu Wang, Xinyu Wang, Xinyu Wang
Unverified author pages with similar names: Xinyu Wang
2026
EvoEdit: Evolving Null-space Alignment for Robust and Efficient Knowledge Editing
Sicheng Lyu | Yu Gu | Xinyu Wang | Jerry Huang | Sitao Luan | Yufei Cui | Xiao-Wen Chang | Peng Lu
Findings of the Association for Computational Linguistics: ACL 2026
Sicheng Lyu | Yu Gu | Xinyu Wang | Jerry Huang | Sitao Luan | Yufei Cui | Xiao-Wen Chang | Peng Lu
Findings of the Association for Computational Linguistics: ACL 2026
Large language models (LLMs) require continual updates to rectify outdated or erroneous knowledge. Model editing has emerged as a compelling paradigm for introducing targeted modifications without the computational burden of full retraining. Existing approaches are mainly based on a locate-then-edit framework. However, in sequential editing contexts, where multiple updates are applied over time, they exhibit significant limitations and suffer from catastrophic interference, i.e., new edits compromise previously integrated updates and degrade preserved knowledge. To address these challenges, we introduce EvoEdit, a novel editing strategy that mitigates catastrophic interference through sequential null-space alignment, enabling stable and efficient model editing. By performing sequential null-space alignment for each incoming edit, EvoEdit preserves both original and previously modified knowledge representations and maintains output invariance on preserved knowledge even across long edit sequences, effectively mitigating interference. Evaluations on real-world sequential knowledge-editing benchmarks show that EvoEdit achieves better or comparable performance than prior state-of-the-art locate-then-edit techniques, with up to 3.53× speedup. Overall, these results underscore the necessity of developing more principled approaches for designing LLMs in dynamically evolving information settings, while providing a simple yet effective solution with strong theoretical guarantees.
MATCH: Modulating Attention via In-Context Retrieval for Long-Context Transformers
Linrui Ma | Chun Hei Lo | Xinyu Wang | Peng Lu | Xihao Yuan | Hanting Chen | Kai Han | Xinghao Chen | Chengjun Zhan | Hanlin xu | Yichun Yin | Lifeng Shang | Feng Wen | Boxing Chen | Yufei Cui
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Linrui Ma | Chun Hei Lo | Xinyu Wang | Peng Lu | Xihao Yuan | Hanting Chen | Kai Han | Xinghao Chen | Chengjun Zhan | Hanlin xu | Yichun Yin | Lifeng Shang | Feng Wen | Boxing Chen | Yufei Cui
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
The quadratic computational cost of traditional attention mechanisms poses a major bottleneck to the scalability and practical deployment of large language models (LLMs), particularly in long-context scenarios. To improve efficiency, existing approaches often enforce rigid structural constraints such as local attention windows. However, these strategies typically lead to substantial performance degradation on tasks requiring precise long-range recall. In this work, we propose MATCH, a scalable and efficient framework that augments sparsified attention mechanisms with dynamically integrated in-context information through an efficient retrieval system. Empirical results show that MATCH significantly improves the performance of sparse-attention models on both synthetic and real-world natural-language tasks. These findings highlight the versatility of MATCH as a general approach for enhancing in-context retrieval capabilities while maintaining the efficiency benefits of sparse attention architectures.
MARS: Unleashing the Power of Speculative Decoding via Margin-Aware Verification
Jingwei Song | Xinyu Wang | Hanbin Wang | Xiaoxuan Lei | Tianyu Shi | Shixin Han | Eric Yang | Xiao-Wen Chang | Lynn Ai
Findings of the Association for Computational Linguistics: ACL 2026
Jingwei Song | Xinyu Wang | Hanbin Wang | Xiaoxuan Lei | Tianyu Shi | Shixin Han | Eric Yang | Xiao-Wen Chang | Lynn Ai
Findings of the Association for Computational Linguistics: ACL 2026
Speculative Decoding (SD) accelerates autoregressive large language model (LLM) inference by decoupling generation and verification. While recent methods improve draft quality by tightly coupling the drafter with the target model, the verification mechanism itself remains largely unchanged, relying on strict token-level rejection sampling. In practice, modern LLMs frequently operate in low-margin regimes where the target model exhibits weak preference among top candidates. In such cases, rejecting plausible runner-up tokens yields negligible information gain while incurring substantial rollback cost, leading to a fundamental inefficiency in verification.We propose Margin-Aware Speculative Verification, a training-free and domain-agnostic verification strategy that adapts to the target model’s local decisiveness. Our method conditions verification on decision stability measured directly from the target logits and relaxes rejection only when strict verification provides minimal benefit. Importantly, the approach modifies only the verification rule and is fully compatible with existing target-coupled speculative decoding frameworks. Extensive experiments across model scales ranging from 8B to 235B demonstrate that our method delivers consistent and significant inference speedups over state-of-the-art baselines while preserving generation quality across diverse benchmarks. The code is available at https://github.com/5SSjw/MARS.
2025
Boosting Data Utilization for Multilingual Dense Retrieval
Chao Huang | Fengran Mo | Yufeng Chen | Changhao Guan | Zhenrui Yue | Xinyu Wang | Jinan Xu | Kaiyu Huang
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Chao Huang | Fengran Mo | Yufeng Chen | Changhao Guan | Zhenrui Yue | Xinyu Wang | Jinan Xu | Kaiyu Huang
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Multilingual dense retrieval aims to retrieve relevant documents across different languages based on a unified retriever model. The challenge lies in aligning representations of different languages in a shared vector space. The common practice is to fine-tune the dense retriever via contrastive learning, whose effectiveness highly relies on the quality of the negative sample and the efficacy of mini-batch data. Different from the existing studies that focus on developing sophisticated model architecture, we propose a method to boost data utilization for multilingual dense retrieval by obtaining high-quality hard negative samples and effective mini-batch data. The extensive experimental results on a multilingual retrieval benchmark, MIRACL, with 16 languages demonstrate the effectiveness of our method by outperforming several existing strong baselines.
Improving Context Fidelity via Native Retrieval-Augmented Reasoning
Suyuchen Wang | Jinlin Wang | Xinyu Wang | Shiqi Li | Xiangru Tang | Sirui Hong | Xiao-Wen Chang | Chenglin Wu | Bang Liu
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Suyuchen Wang | Jinlin Wang | Xinyu Wang | Shiqi Li | Xiangru Tang | Sirui Hong | Xiao-Wen Chang | Chenglin Wu | Bang Liu
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Large language models (LLMs) often struggle with context fidelity, producing inconsistent answers when responding to questions based on provided information. Existing approaches either rely on expensive supervised fine-tuning to generate evidence post-answer or train models to perform web searches without necessarily improving utilization of the given context. We propose CARE, a novel native retrieval-augmented reasoning framework that teaches LLMs to explicitly integrate in-context evidence within their reasoning process with the model’s own retrieval capabilities. Our method requires limited labeled evidence data while significantly enhancing both retrieval accuracy and answer generation performance through strategically retrieved in-context tokens in the reasoning chain. Extensive experiments on multiple real-world and counterfactual QA benchmarks demonstrate that our approach substantially outperforms supervised fine-tuning, traditional retrieval-augmented generation methods, and external retrieval solutions. This work represents a fundamental advancement in making LLMs more accurate, reliable, and efficient for knowledge-intensive tasks.
STRICT: Stress-Test of Rendering Image Containing Text
Tianyu Zhang | Xinyu Wang | Lu Li | Zhenghan Tai | Jijun Chi | Jingrui Tian | Hailin He | Suyuchen Wang
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Tianyu Zhang | Xinyu Wang | Lu Li | Zhenghan Tai | Jijun Chi | Jingrui Tian | Hailin He | Suyuchen Wang
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
While diffusion models have revolutionized text-to-image generation with their ability to synthesize realistic and diverse scenes, they continue to struggle with generating consistent and legible text within images. This shortcoming is commonly attributed to the locality bias inherent in diffusion-based generation, which limits their capacity to model long-range spatial dependencies. In this paper, we introduce STRICT, a benchmark designed to systematically stress-test the ability of diffusion models to render coherent and instruction-aligned text in images. Our benchmark evaluates models across multiple dimensions: (1) the maximum length of readable text that can be generated and (2) the correctness and legibility of the generated text. We assess several state-of-the-art models, including proprietary and open-source variants, and reveal persistent limitations in long-range consistency and instruction-following capabilities. Our findings provide insights into architectural bottlenecks and motivate future research directions in multimodal generative modeling.
Search
Fix author
Co-authors
- Xiao-Wen Chang 3
- Yufei Cui 2
- Peng Lu 2
- Suyuchen Wang 2
- Lynn Ai 1
- Yufeng Chen 1
- Hanting Chen 1
- Xinghao Chen 1
- Boxing Chen 1
- Jijun Chi 1
- Yu Gu (谷峪) 1
- Changhao Guan 1
- Kai Han 1
- Shixin Han 1
- Hailin He 1
- Sirui Hong 1
- Chao Huang 1
- Kaiyu Huang (黄锴宇) 1
- Jerry Huang 1
- Xiaoxuan Lei 1
- Shiqi Li 1
- Lu Li 1
- Bang Liu 1
- Chun Hei Lo 1
- Sitao Luan 1
- Sicheng Lyu 1
- Linrui Ma 1
- Fengran Mo 1
- Lifeng Shang 1
- Tianyu Shi 1
- Jingwei Song 1
- Zhenghan Tai 1
- Xiangru Tang 1
- Jingrui Tian 1
- Jinlin Wang 1
- Hanbin Wang 1
- Feng Wen 1
- Chenglin Wu 1
- Jinan Xu (徐金安) 1
- Eric Yang 1
- Yichun Yin 1
- Xihao Yuan 1
- Zhenrui Yue 1
- Chengjun Zhan 1
- Tianyu Zhang 1
- Hanlin xu 1