Jiaming Guo
2026
QiMeng-PRepair: Precise Code Repair via Edit-Aware Reward Optimization
Changxin Ke | Rui Zhang | Jiaming Guo | Yuanbo Wen | Li Ding | Shuo Wang | Xuyuan Zhu | Xiong Peng | Di Huang | Zidong Du | Xing Hu | Qi Guo | Yunji Chen
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Changxin Ke | Rui Zhang | Jiaming Guo | Yuanbo Wen | Li Ding | Shuo Wang | Xuyuan Zhu | Xiong Peng | Di Huang | Zidong Du | Xing Hu | Qi Guo | Yunji Chen
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Large Language Models (LLMs) achieve strong program repair performance but often suffer from over-editing, where excessive modifications overwrite correct code and hinder bug localization. We systematically quantify its impact and introduce precise repair task, which maximizes reuse of correct code while fixing only buggy parts. Building on this insight, we propose PRepair, a framework that mitigates over-editing and improves repair accuracy. PRepair has two components: Self-Breaking, which generates diverse buggy programs via controlled bug injection and min–max sampling, and Self-Repairing, which trains models with Edit-Aware Group Relative Policy Optimization (EA-GRPO) using an edit-aware reward to encourage minimal yet correct edits. Experiments show that PRepair improves repair precision by up to 31.4% under fix1@1, a metric that jointly considers repair correctness and extent, and significantly increases decoding throughput when combined with speculative editing, demonstrating its potential for precise and practical code repair.
2024
Ex3: Automatic Novel Writing by Extracting, Excelsior and Expanding
Huang Lei | Jiaming Guo | Guanhua He | Xishan Zhang | Rui Zhang | Shaohui Peng | Shaoli Liu | Tianshi Chen
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Huang Lei | Jiaming Guo | Guanhua He | Xishan Zhang | Rui Zhang | Shaohui Peng | Shaoli Liu | Tianshi Chen
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Generating long-term texts such as novels using artificial intelligence has always been a challenge. A common approach is to use large language models (LLMs) to construct a hierarchical framework that first plans and then writes. Despite the fact that the generated novels reach a sufficient length, they exhibit poor logical coherence and appeal in their plots and deficiencies in character and event depiction, ultimately compromising the overall narrative quality. In this paper, we propose a method named Extracting Excelsior and Expanding. Ex3 initially extract structural information by learning from raw novel data. By combining this structure information with the novel data, an instruction-following dataset is meticulously crafted. This dataset is then utilized to fine-tune the LLM, aiming for excelsior generation performance. In the final stage, a tree-like expansion method is deployed to facilitate the generation of arbitrarily long novels.Evaluation against previous methods showcases Ex3’s ability to produce higher-quality long-form novels.