Rust-doctor: Enhanced Feature for Rust Ownership and Lifetime Repair with Balanced Training Data Generation

Wenzhang Yang; Xiaoning Ren; Cuifeng Gao; Yinxing Xue

doi:10.18653/v1/2025.findings-emnlp.179

Rust-doctor: Enhanced Feature for Rust Ownership and Lifetime Repair with Balanced Training Data Generation

Wenzhang Yang, Xiaoning Ren, Cuifeng Gao, Yinxing Xue

Abstract

As a relatively new programming language, Rust has gained significant popularity in recent years due to its safety features during compilation. However, Rust developers often face challenges stemming from its strict compilation checks due to the steep learning curve of safety rules. To make matters worse, the lack of training data and the unique semantics of Rust lead to poor performance in learning-based automated program repair techniques. To address these challenges, we propose a novel error injection approach to generate a balanced training dataset and leverage the Mid-level Intermediate Representation (MIR) as enhanced features for Rust’s unique compilation error repair. Using these innovations, we fine-tuned a new code model, LLaRRA: Large Language and Rust Repair Assistant. Experimental results demonstrate that LLaRRA significantly outperforms state-of-the-art models in terms of Pass@K and Acc@K.

Anthology ID:: 2025.findings-emnlp.179
Volume:: Findings of the Association for Computational Linguistics: EMNLP 2025
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 3366–3376
Language:
URL:: https://preview.aclanthology.org/author-page-yu-wang-polytechnic/2025.findings-emnlp.179/
DOI:: 10.18653/v1/2025.findings-emnlp.179
Bibkey:
Cite (ACL):: Wenzhang Yang, Xiaoning Ren, Cuifeng Gao, and Yinxing Xue. 2025. Rust-doctor: Enhanced Feature for Rust Ownership and Lifetime Repair with Balanced Training Data Generation. In Findings of the Association for Computational Linguistics: EMNLP 2025, pages 3366–3376, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: Rust-doctor: Enhanced Feature for Rust Ownership and Lifetime Repair with Balanced Training Data Generation (Yang et al., Findings 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/author-page-yu-wang-polytechnic/2025.findings-emnlp.179.pdf
Checklist:: 2025.findings-emnlp.179.checklist.pdf

PDF Cite Search Checklist Fix data