Xu Han
Other people with similar names: Xu Han, Xu Han, Xu Han
Unverified author pages with similar names: Xu Han
2026
StateX: Enhancing RNN Recall via Post-training State Expansion
Xingyu Shen | Yingfa Chen | Zhen Leng Thai | Xu Han | Zhiyuan Liu | Maosong Sun
Findings of the Association for Computational Linguistics: ACL 2026
Xingyu Shen | Yingfa Chen | Zhen Leng Thai | Xu Han | Zhiyuan Liu | Maosong Sun
Findings of the Association for Computational Linguistics: ACL 2026
Recurrent neural networks (RNNs), such as linear attention and state-space models, have gained popularity due to their constant per-token complexity when processing long contexts. However, these recurrent models struggle with tasks that require accurate recall of contextual information from long contexts, because all contextual information is compressed into a fixed-size recurrent state. Previous studies have shown that recall ability is positively correlated with the recurrent state size, yet directly training RNNs with large recurrent states results in high training costs. In this paper, we introduce StateX, a post-training framework that efficiently expands the states of pre-trained RNNs. For two popular classes of RNNs, linear attention and state-space models, we design post-training architectural modifications in StateX, to scale up the state size with no or negligible increase in model parameters. Experiments on models with up to 1.3B parameters demonstrate that StateX efficiently enhances the recall and in-context learning performance of RNNs without incurring high post-training costs or compromising other capabilities.
AutoReproduce: Automatic AI Experiment Reproduction with Paper Lineage
Xuanle Zhao | Zilin Sang | Yuxuan Li | Qi Shi | Weilun Zhao | Shuo Wang | Duzhen Zhang | Xu Han | Zhiyuan Liu | Maosong Sun
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Xuanle Zhao | Zilin Sang | Yuxuan Li | Qi Shi | Weilun Zhao | Shuo Wang | Duzhen Zhang | Xu Han | Zhiyuan Liu | Maosong Sun
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Efficient reproduction of research papers is pivotal to accelerating scientific progress. However, the increasing complexity of proposed methods often renders reproduction a labor-intensive endeavor, necessitating profound domain expertise.To address this, we introduce the paper lineage, which systematically mines implicit knowledge from the cited literature. This algorithm serves as the backbone of our proposed , a multi-agent framework designed to autonomously reproduce experimental code in a complete, end-to-end manner. To ensure code executability, incorporates a sampling-based unit testing strategy for rapid validation. To assess reproduction capabilities, we introduce , a benchmark featuring verified implementations, alongside comprehensive metrics for evaluating both reproduction and execution fidelity. Extensive evaluations on PaperBench and demonstrate that consistently surpasses existing baselines across all metrics. Notably, it yields substantial improvements in reproduction fidelity and final execution performance. The code is available at https://github.com/AI9Stars/AutoReproduce.
CheckRLM: Effective Knowledge–Thought Coherence Checking in Retrieval-Augmented Reasoning
Dingling Xu | Ruobing Wang | Qingfei Zhao | Yukun Yan | Zhichun Wang | Daren Zha | Shi Yu | Zhenghao Liu | Shuo Wang | Xu Han | Maosong Sun
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Dingling Xu | Ruobing Wang | Qingfei Zhao | Yukun Yan | Zhichun Wang | Daren Zha | Shi Yu | Zhenghao Liu | Shuo Wang | Xu Han | Maosong Sun
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Reasoning Language Models (RLMs) have significantly improved performance on complex tasks by extending the reasoning chain. However, these chains are prone to containing factual errors, particularly in knowledge-intensive tasks. To address this issue, we propose **CheckRLM**, a framework that improves the reliability of the reasoning process through Retrieval-Augmented Generation (RAG) by timely checking and correcting factual errors. Specifically, CheckRLM extracts factual claims from the reasoning chain to identify and localize subtle knowledge inconsistencies during inference. Upon detection of errors, a refinement mechanism performs minimal-cost yet precise corrections by leveraging external knowledge, ensuring coherence between the reasoning chain and correct knowledge. Extensive experiments demonstrate that CheckRLM substantially outperforms existing baselines, exhibiting a strong capability to mitigate error accumulation in long-horizon reasoning with lower costs. The code and data are available at https://github.com/AI9Stars/CheckRLM.
AutoVecCoder: Teaching LLMs to Generate Explicitly Vectorized Code
ShangZhan Li | Xinyu Yin | Xuanyu Jin | Ye He | Yuxin Zhou | Yuxuan Li | Xu Han | Wanxiang Che | Qi Shi | Ting Liu | Maosong Sun
Findings of the Association for Computational Linguistics: ACL 2026
ShangZhan Li | Xinyu Yin | Xuanyu Jin | Ye He | Yuxin Zhou | Yuxuan Li | Xu Han | Wanxiang Che | Qi Shi | Ting Liu | Maosong Sun
Findings of the Association for Computational Linguistics: ACL 2026
Vectorization via Single Instruction, Multiple Data (SIMD) architectures is a cornerstone of high-performance computing. To fully exploit hardware potential, developers often resort to explicit vectorization using intrinsics, as compiler-based auto-vectorization frequently yields suboptimal results due to conservative static analysis. While Large Language Models (LLMs) have demonstrated remarkable proficiency in general code generation, they struggle with explicit vectorization due to the scarcity of high-quality corpora and the strict semantic constraints of low-level hardware instructions. In this paper, we propose AutoVecCoder, a novel framework designed to empower LLMs with the capability of automated explicit vectorization. AutoVecCoder integrates two core components: VecPrompt, an automated data synthesis pipeline to inject domain-specific intrinsic knowledge; and VecRL, a reinforcement learning framework that aligns code generation with execution efficiency. AutoVecCoder-8B trained by this framework achieves state-of-the-art performance on the SSE and AVX subsets of SimdBench and, in some cases, generates implementations surpassing standard optimizations, effectively overcoming the inherent bottlenecks of traditional automated vectorization.
APB-V: Accelerating Long-Video Understanding via Sequence-Parallelism-aware Approximate Attention
Yuxiang Huang | Mingye Li | Xu Han | Chaojun Xiao | Weilin Zhao | Ao Sun | Ziqi Yuan | Hao Zhou | Fandong Meng | Zhiyuan Liu
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Yuxiang Huang | Mingye Li | Xu Han | Chaojun Xiao | Weilin Zhao | Ao Sun | Ziqi Yuan | Hao Zhou | Fandong Meng | Zhiyuan Liu
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
The efficiency of long-video inference remains a critical bottleneck, mainly due to the dense computation in the prefill stage of Large Multimodal Models (LMMs). Existing methods either compress visual embeddings or apply sparse attention on a single GPU, yielding limited acceleration or degraded performance and restricting LMMs from handling longer, more complex videos. To overcome these issues, we propose APB-V, a sequence-parallel framework with optimized attention that accelerates long-video inference across multiple GPUs. By distributing approximate attention, APB-V reduces computation and increases parallelism, enabling efficient processing of more visual embeddings without compression and thereby improving task performance. System-level optimizations, such as load balancing and fused forward passes, further unleash the potential of APB-V, delivering speedups of 12.72x, 1.70x, and 1.18x over FlashAttn, ZigZagRing, and APB, without notable performance loss.
Search
Fix author
Co-authors
- Maosong Sun (孙茂松) 4
- Zhiyuan Liu 3
- Yuxuan Li 2
- Qi Shi 2
- Shuo Wang 2
- Wanxiang Che (车万翔) 1
- Yingfa Chen 1
- Ye He 1
- Yuxiang Huang 1
- Xuanyu Jin 1
- ShangZhan Li 1
- Mingye Li 1
- Zhenghao Liu (刘正皓) 1
- Ting Liu 1
- Fandong Meng 1
- Zilin Sang 1
- Xingyu Shen 1
- Ao Sun 1
- Zhen Leng Thai 1
- Ruobing Wang 1
- Zhichun Wang (王志春) 1
- Chaojun Xiao 1
- Dingling Xu 1
- Yukun Yan (闫宇坤) 1
- Xinyu Yin 1
- Shi Yu (于是) 1
- Ziqi Yuan 1
- Daren Zha 1
- Duzhen Zhang 1
- Xuanle Zhao 1
- Weilun Zhao 1
- Qingfei Zhao 1
- Weilin Zhao 1
- Yuxin Zhou 1
- Hao Zhou 1