Junjie Wang
Other people with similar names: Junjie Wang
Unverified author pages with similar names: Junjie Wang
2026
ProFit: Leveraging High-Value Signals in SFT via Probability-Guided Token Selection
Tao Liu | Taiqiang Wu | Runming Yang | Shaoning Sun | Junjie Wang | Yujiu Yang
Findings of the Association for Computational Linguistics: ACL 2026
Tao Liu | Taiqiang Wu | Runming Yang | Shaoning Sun | Junjie Wang | Yujiu Yang
Findings of the Association for Computational Linguistics: ACL 2026
Supervised fine-tuning (SFT) is a fundamental post-training strategy to align Large Language Models (LLMs) with human intent. However, traditional SFT often ignores the one-to-many nature of language by forcing alignment with a single reference answer, leading to the model overfitting to non-core expressions. Although our empirical analysis suggests that introducing multiple reference answers can mitigate this issue, the prohibitive data and computational costs necessitate a strategic shift: prioritizing the mitigation of single-reference overfitting over the costly pursuit of answer diversity. To achieve this, we reveal the intrinsic connection between token probability and semantic importance: high-probability tokens carry the core logical framework, while low-probability tokens are mostly replaceable expressions. Based on this insight, we propose ProFit, which selectively masks low-probability tokens to prevent surface-level overfitting. Extensive experiments confirm that ProFit consistently outperforms traditional SFT baselines on general reasoning and mathematical benchmarks
A.S.E: A Repository-Level Benchmark for Evaluating Security in AI-Generated Code
Keke Lian | Wang Bin | Lei Zhang | Libo Chen | Junjie Wang | Ziming Zhao | Yujiu Yang | Miaoqian Lin | Haotong Duan | Haoran Zhao | Shuang Liao | Mingda Guo | Quan Jiazheng | Yilu Zhong | Chenhao He | Chen Zichuan | Jie Wu | Haoling Li | Zhaoxuan Li | Jiongchi Yu | Hui LI | Dong Zhang
Findings of the Association for Computational Linguistics: ACL 2026
Keke Lian | Wang Bin | Lei Zhang | Libo Chen | Junjie Wang | Ziming Zhao | Yujiu Yang | Miaoqian Lin | Haotong Duan | Haoran Zhao | Shuang Liao | Mingda Guo | Quan Jiazheng | Yilu Zhong | Chenhao He | Chen Zichuan | Jie Wu | Haoling Li | Zhaoxuan Li | Jiongchi Yu | Hui LI | Dong Zhang
Findings of the Association for Computational Linguistics: ACL 2026
The increasing adoption of large language models (LLMs) in software engineering necessitates rigorous security evaluation of their generated code. However, existing benchmarks often lack relevance to real-world AI-assisted programming scenarios, making them inadequate for assessing the practical security risks associated with AI-generated code in production environments. To address this gap, we introduce A.S.E (AI Code Generation Security Evaluation), a repository-level evaluation benchmark designed to closely mirror real-world AI programming tasks, offering a comprehensive and reliable framework for assessing the security of AI-generated code. Our evaluation of leading LLMs on A.S.E reveals several key findings. In particular, current LLMs still struggle with secure coding. The complexity in repository-level scenarios presents challenges for LLMs that typically perform well on snippet-level tasks. Moreover, a larger reasoning budget does not necessarily lead to better code generation. These observations offer valuable insights into the current state of AI code generation and help developers identify the most suitable models for practical tasks. They also lay the groundwork for refining LLMs to generate secure and efficient code in real-world applications.
2025
Chain-of-Reasoning: Towards Unified Mathematical Reasoning in Large Language Models via a Multi-Paradigm Perspective
Yiyao Yu | Yuxiang Zhang | Dongdong Zhang | Xiao Liang | Hengyuan Zhang | Xingxing Zhang | Mahmoud Khademi | Hany Hassan Awadalla | Junjie Wang | Yujiu Yang | Furu Wei
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Yiyao Yu | Yuxiang Zhang | Dongdong Zhang | Xiao Liang | Hengyuan Zhang | Xingxing Zhang | Mahmoud Khademi | Hany Hassan Awadalla | Junjie Wang | Yujiu Yang | Furu Wei
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Large Language Models (LLMs) have made notable progress in mathematical reasoning, yet they often rely on single-paradigm reasoning that limits their effectiveness across diverse tasks. In this paper, we introduce Chain-of-Reasoning (CoR), a novel unified framework that integrates multiple reasoning paradigms — Natural Language Reasoning (NLR), Algorithmic Reasoning (AR), and Symbolic Reasoning (SR) — to enable synergistic collaboration. CoR generates multiple potential answers using different reasoning paradigms and synthesizes them into a coherent final solution. We propose a Progressive Paradigm Training (PPT) strategy that allows models to progressively master these paradigms, culminating in the development of at CoR-Math-7B. Experimental results demonstrate that CoR-Math-7B significantly outperforms current SOTA models, achieving up to a 41.0% absolute improvement over GPT-4o in theorem proving tasks and a 15% improvement over RL-based methods on the MATH benchmark in arithmetic tasks. These results show the enhanced mathematical comprehensive ability of our model, enabling zero-shot generalization across tasks.The code is available at https://github.com/microsoft/CoR.
2023
UniEX: An Effective and Efficient Framework for Unified Information Extraction via a Span-extractive Perspective
Yang Ping | JunYu Lu | Ruyi Gan | Junjie Wang | Yuxiang Zhang | Pingjian Zhang | Jiaxing Zhang
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Yang Ping | JunYu Lu | Ruyi Gan | Junjie Wang | Yuxiang Zhang | Pingjian Zhang | Jiaxing Zhang
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
We propose a new paradigm for universal information extraction (IE) that is compatible with any schema format and applicable to a list of IE tasks, such as named entity recognition, relation extraction, event extraction and sentiment analysis. Our approach converts the text-based IE tasks as the token-pair problem, which uniformly disassembles all extraction targets into joint span detection, classification and association problems with a unified extractive framework, namely UniEX. UniEX can synchronously encode schema-based prompt and textual information, and collaboratively learn the generalized knowledge from pre-defined information using the auto-encoder language models. We develop a traffine attention mechanism to integrate heterogeneous factors including tasks, labels and inside tokens, and obtain the extraction target via a scoring matrix. Experiment results show that UniEX can outperform generative universal IE models in terms of performance and inference-speed on 14 benchmarks IE datasets with the supervised setting. The state-of-the-art performance in low-resource scenarios also verifies the transferability and effectiveness of UniEX.
Solving Math Word Problems via Cooperative Reasoning induced Language Models
Xinyu Zhu | Junjie Wang | Lin Zhang | Yuxiang Zhang | Yongfeng Huang | Ruyi Gan | Jiaxing Zhang | Yujiu Yang
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Xinyu Zhu | Junjie Wang | Lin Zhang | Yuxiang Zhang | Yongfeng Huang | Ruyi Gan | Jiaxing Zhang | Yujiu Yang
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Large-scale pre-trained language models (PLMs) bring new opportunities to challenging problems, especially those that need high-level intelligence, such as the math word problem (MWPs). However, directly applying existing PLMs to MWPs can fail as the generation process lacks sufficient supervision and thus lacks fast adaptivity as humans. We notice that human reasoning has a dual reasoning framework that consists of an immediate reaction system (system 1) and a delicate reasoning system (system 2), where the entire reasoning is determined by their interaction. This inspires us to develop a cooperative reasoning-induced PLM for solving MWPs, called Cooperative Reasoning (CoRe), resulting in a human-like reasoning architecture with system 1 as the generator and system 2 as the verifier. In our approach, the generator is responsible for generating reasoning paths, and the verifiers are used to supervise the evaluation in order to obtain reliable feedback for the generator. We evaluate our CoRe framework on several mathematical reasoning datasets and achieve decent improvement over state-of-the-art methods, up to 9.6% increase over best baselines.
Search
Fix author
Co-authors
- Yujiu Yang 4
- Yuxiang Zhang (张宇翔) 3
- Ruyi Gan 2
- Jiaxing Zhang 2
- Wang Bin 1
- Libo Chen 1
- Haotong Duan 1
- Mingda Guo 1
- Hany Hassan Awadalla 1
- Chenhao He 1
- Yongfeng Huang 1
- Quan Jiazheng 1
- Mahmoud Khademi 1
- Hui LI 1
- Haoling Li 1
- Zhaoxuan Li 1
- Keke Lian 1
- Xiao Liang (梁霄) 1
- Shuang Liao 1
- Miaoqian Lin 1
- Tao Liu (刘涛) 1
- Junyu Lu 1
- Yang Ping 1
- Shaoning Sun 1
- Furu Wei 1
- Taiqiang Wu 1
- Jie Wu 1
- Runming Yang 1
- Yiyao Yu 1
- Jiongchi Yu 1
- Dongdong Zhang 1
- Hengyuan Zhang 1
- Xingxing Zhang 1
- Lei Zhang 1
- Dong Zhang 1
- Pingjian Zhang 1
- Lin Zhang 1
- Ziming Zhao 1
- Haoran Zhao 1
- Yilu Zhong 1
- Xinyu Zhu 1
- Chen Zichuan 1