Qingyu Ren
2025
Beyond Correctness: Confidence-Aware Reward Modeling for Enhancing Large Language Model Reasoning
Qianxi He
|
Qingyu Ren
|
Shanzhe Lei
|
Xuhong Wang
|
Yingchun Wang
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Order Matters: Investigate the Position Bias in Multi-constraint Instruction Following
Jie Zeng
|
Qianyu He
|
Qingyu Ren
|
Jiaqing Liang
|
Weikang Zhou
|
Zeye Sun
|
Fei Yu
|
Yanghua Xiao
Findings of the Association for Computational Linguistics: ACL 2025
Step-by-Step Mastery: Enhancing Soft Constraint Following Ability of Large Language Models
Qingyu Ren
|
Jie Zeng
|
Qianyu He
|
Jiaqing Liang
|
Yanghua Xiao
|
Weikang Zhou
|
Zeye Sun
|
Fei Yu
Findings of the Association for Computational Linguistics: ACL 2025