Xinyu Wang
McGill
Other people with similar names: Xinyu Wang (Warwick, King’s College London), Xinyu Wang
Unverified author pages with similar names: Xinyu Wang
2026
MATCH: Modulating Attention via In-Context Retrieval for Long-Context Transformers
Linrui Ma | Chun Hei Lo | Xinyu Wang | Peng Lu | Xihao Yuan | Hanting Chen | Kai Han | Xinghao Chen | Chengjun Zhan | Hanlin xu | Yichun Yin | Lifeng Shang | Feng Wen | Boxing Chen | Yufei Cui
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Linrui Ma | Chun Hei Lo | Xinyu Wang | Peng Lu | Xihao Yuan | Hanting Chen | Kai Han | Xinghao Chen | Chengjun Zhan | Hanlin xu | Yichun Yin | Lifeng Shang | Feng Wen | Boxing Chen | Yufei Cui
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
The quadratic computational cost of traditional attention mechanisms poses a major bottleneck to the scalability and practical deployment of large language models (LLMs), particularly in long-context scenarios. To improve efficiency, existing approaches often enforce rigid structural constraints such as local attention windows. However, these strategies typically lead to substantial performance degradation on tasks requiring precise long-range recall. In this work, we propose MATCH, a scalable and efficient framework that augments sparsified attention mechanisms with dynamically integrated in-context information through an efficient retrieval system. Empirical results show that MATCH significantly improves the performance of sparse-attention models on both synthetic and real-world natural-language tasks. These findings highlight the versatility of MATCH as a general approach for enhancing in-context retrieval capabilities while maintaining the efficiency benefits of sparse attention architectures.
2025
Boosting Data Utilization for Multilingual Dense Retrieval
Chao Huang | Fengran Mo | Yufeng Chen | Changhao Guan | Zhenrui Yue | Xinyu Wang | Jinan Xu | Kaiyu Huang
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Chao Huang | Fengran Mo | Yufeng Chen | Changhao Guan | Zhenrui Yue | Xinyu Wang | Jinan Xu | Kaiyu Huang
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Multilingual dense retrieval aims to retrieve relevant documents across different languages based on a unified retriever model. The challenge lies in aligning representations of different languages in a shared vector space. The common practice is to fine-tune the dense retriever via contrastive learning, whose effectiveness highly relies on the quality of the negative sample and the efficacy of mini-batch data. Different from the existing studies that focus on developing sophisticated model architecture, we propose a method to boost data utilization for multilingual dense retrieval by obtaining high-quality hard negative samples and effective mini-batch data. The extensive experimental results on a multilingual retrieval benchmark, MIRACL, with 16 languages demonstrate the effectiveness of our method by outperforming several existing strong baselines.
Improving Context Fidelity via Native Retrieval-Augmented Reasoning
Suyuchen Wang | Jinlin Wang | Xinyu Wang | Shiqi Li | Xiangru Tang | Sirui Hong | Xiao-Wen Chang | Chenglin Wu | Bang Liu
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Suyuchen Wang | Jinlin Wang | Xinyu Wang | Shiqi Li | Xiangru Tang | Sirui Hong | Xiao-Wen Chang | Chenglin Wu | Bang Liu
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Large language models (LLMs) often struggle with context fidelity, producing inconsistent answers when responding to questions based on provided information. Existing approaches either rely on expensive supervised fine-tuning to generate evidence post-answer or train models to perform web searches without necessarily improving utilization of the given context. We propose CARE, a novel native retrieval-augmented reasoning framework that teaches LLMs to explicitly integrate in-context evidence within their reasoning process with the model’s own retrieval capabilities. Our method requires limited labeled evidence data while significantly enhancing both retrieval accuracy and answer generation performance through strategically retrieved in-context tokens in the reasoning chain. Extensive experiments on multiple real-world and counterfactual QA benchmarks demonstrate that our approach substantially outperforms supervised fine-tuning, traditional retrieval-augmented generation methods, and external retrieval solutions. This work represents a fundamental advancement in making LLMs more accurate, reliable, and efficient for knowledge-intensive tasks.
STRICT: Stress-Test of Rendering Image Containing Text
Tianyu Zhang | Xinyu Wang | Lu Li | Zhenghan Tai | Jijun Chi | Jingrui Tian | Hailin He | Suyuchen Wang
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Tianyu Zhang | Xinyu Wang | Lu Li | Zhenghan Tai | Jijun Chi | Jingrui Tian | Hailin He | Suyuchen Wang
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
While diffusion models have revolutionized text-to-image generation with their ability to synthesize realistic and diverse scenes, they continue to struggle with generating consistent and legible text within images. This shortcoming is commonly attributed to the locality bias inherent in diffusion-based generation, which limits their capacity to model long-range spatial dependencies. In this paper, we introduce STRICT, a benchmark designed to systematically stress-test the ability of diffusion models to render coherent and instruction-aligned text in images. Our benchmark evaluates models across multiple dimensions: (1) the maximum length of readable text that can be generated and (2) the correctness and legibility of the generated text. We assess several state-of-the-art models, including proprietary and open-source variants, and reveal persistent limitations in long-range consistency and instruction-following capabilities. Our findings provide insights into architectural bottlenecks and motivate future research directions in multimodal generative modeling.
Search
Fix author
Co-authors
- Suyuchen Wang 2
- Xiao-Wen Chang 1
- Boxing Chen 1
- Hanting Chen 1
- Xinghao Chen 1
- Yufeng Chen 1
- Jijun Chi 1
- Yufei Cui 1
- Changhao Guan 1
- Kai Han 1
- Hailin He 1
- Sirui Hong 1
- Chao Huang 1
- Kaiyu Huang (黄锴宇) 1
- Lu Li 1
- Shiqi Li 1
- Bang Liu 1
- Chun Hei Lo 1
- Peng Lu 1
- Linrui Ma 1
- Fengran Mo 1
- Lifeng Shang 1
- Zhenghan Tai 1
- Xiangru Tang 1
- Jingrui Tian 1
- Jinlin Wang 1
- Feng Wen 1
- Chenglin Wu 1
- Jinan Xu (徐金安) 1
- Yichun Yin 1
- Xihao Yuan 1
- Zhenrui Yue 1
- Chengjun Zhan 1
- Tianyu Zhang 1
- Hanlin xu 1