Xuecheng Wu
2026
A Survey of Inductive Reasoning for Large Language Models
Kedi Chen | Dezhao Ruan | Yuhao Dan | Yaoting Wang | Siyu Yan | Xuecheng Wu | Yinqi Zhang | Qin Chen | Jie Zhou | Liang He | Biqing Qi | Linyang Li | Qipeng Guo | Xiaoming Shi | Wei Zhang
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Kedi Chen | Dezhao Ruan | Yuhao Dan | Yaoting Wang | Siyu Yan | Xuecheng Wu | Yinqi Zhang | Qin Chen | Jie Zhou | Liang He | Biqing Qi | Linyang Li | Qipeng Guo | Xiaoming Shi | Wei Zhang
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Reasoning is an important task for large language models (LLMs). Among all the reasoning paradigms, inductive reasoning is one of the basic types, which is characterized by its particular-to-general thinking process and the non-uniqueness of its answers. The inductive mode is crucial for knowledge generalization and aligns better with human cognition, so it is a fundamental mode of learning, hence attracting increasing interest. Despite the importance of inductive reasoning, there is no systematic summary of it. Therefore, this paper presents the first comprehensive survey of inductive reasoning for LLMs. First, methods for improving inductive reasoning are categorized into three main areas: post-training enhancement, test-time exploration, and data augmentation. Then, current benchmarks of inductive reasoning are summarized, and a unified sandbox-based evaluation approach with the observation coverage metric is derived. Finally, we offer some analyses regarding the source of inductive ability and how simple model architectures and data help with inductive tasks, providing a solid foundation for future research.
TiKMiX: Efficient Semi-Dynamic Data Mixture via Data Influence for LLM Pre-training
Yifan Wang | Binbinliu | Fengze Liu | Yuanfan Guo | Jiyao Deng | Xuecheng Wu | Weidong Zhou | Xiaohuan Zhou | Taifeng Wang
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Yifan Wang | Binbinliu | Fengze Liu | Yuanfan Guo | Jiyao Deng | Xuecheng Wu | Weidong Zhou | Xiaohuan Zhou | Taifeng Wang
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
The data mixture used in the pre-training of a language model is a cornerstone of its final performance. Static data mixing strategies in Large Language Model (LLM) pre-training are often suboptimal as they fail to adapt to the model’s evolving learning states. Conversely, fully online dynamic updates, while adaptive, incur prohibitive computational costs. To bridge this gap, we propose TiKMiX, an efficient semi-dynamic data mixing framework. Our approach is grounded in a key observation of influence ranking invariance: the relative importance of data domains exhibits strong temporal stability over long training intervals. Leveraging this insight, we propose Group Influence, an efficient approach for quantifying domain impact, and formulate data mixing as a periodic, low-overhead influence maximization problem. Compared with REGMIX, the proposed method reduces computational overhead by 80% and achieves an average performance gain of 2% across nine downstream benchmarks, thereby effectively mitigating data under-digestion.
2025
MUSE: MCTS-Driven Red Teaming Framework for Enhanced Multi-Turn Dialogue Safety in Large Language Models
Siyu Yan | Long Zeng | Xuecheng Wu | Chengcheng Han | Kongcheng Zhang | Chong Peng | Xuezhi Cao | Xunliang Cai | Chenjuan Guo
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Siyu Yan | Long Zeng | Xuecheng Wu | Chengcheng Han | Kongcheng Zhang | Chong Peng | Xuezhi Cao | Xunliang Cai | Chenjuan Guo
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
As large language models (LLMs) become widely adopted, ensuring their alignment with human values is crucial to prevent jailbreaks where adversaries manipulate models to produce harmful content. While most defenses target single-turn attacks, real-world usage often involves multi-turn dialogues, exposing models to attacks that exploit conversational context to bypass safety measures. We introduce MUSE, a comprehensive framework tackling multi-turn jailbreaks from both attack and defense angles. For attacks, we propose MUSE-A, a method that uses frame semantics and heuristic tree search to explore diverse semantic trajectories. For defense, we present MUSE-D, a fine-grained safety alignment approach that intervenes early in dialogues to reduce vulnerabilities. Extensive experiments on various models show that MUSE effectively identifies and mitigates multi-turn vulnerabilities. Code is available at https://anonymous.4open.science/r/MUSE-75F7.
Search
Fix author
Co-authors
- Siyu Yan 2
- Binbinliu 1
- Xunliang Cai 1
- Xuezhi Cao 1
- Kedi Chen 1
- Qin Chen 1
- Yuhao Dan 1
- Jiyao Deng 1
- Qipeng Guo 1
- Chenjuan Guo 1
- Yuanfan Guo 1
- Chengcheng Han 1
- Liang He 1
- Linyang Li 1
- Fengze Liu 1
- Chong Peng 1
- Biqing Qi 1
- Dezhao Ruan 1
- Xiaoming Shi 1
- Yaoting Wang 1
- Yifan Wang 1
- Taifeng Wang 1
- Long Zeng 1
- Yinqi Zhang 1
- Wei Zhang 1
- Kongcheng Zhang 1
- Jie Zhou 1
- Weidong Zhou 1
- Xiaohuan Zhou 1