[2603.01048] RepoRepair: Leveraging Code Documentation for Repository-Level Automated Program Repair
About this article
Abstract page for arXiv paper 2603.01048: RepoRepair: Leveraging Code Documentation for Repository-Level Automated Program Repair
Computer Science > Software Engineering arXiv:2603.01048 (cs) [Submitted on 1 Mar 2026] Title:RepoRepair: Leveraging Code Documentation for Repository-Level Automated Program Repair Authors:Zhongqiang Pan, Chuanyi Li, Wenkang Zhong, Yi Feng, Bin Luo, Vincent Ng View a PDF of the paper titled RepoRepair: Leveraging Code Documentation for Repository-Level Automated Program Repair, by Zhongqiang Pan and 5 other authors View PDF HTML (experimental) Abstract:Automated program repair (APR) struggles to scale from isolated functions to full repositories, as it demands a global, task-aware understanding to locate necessary changes. Current methods, limited by context and reliant on shallow retrieval or costly agent iterations, falter on complex cross-file issues. To this end, we propose RepoRepair, a novel documentation-enhanced approach for repository-level fault localization and program repair. Our core insight is to leverage LLMs to generate hierarchical code documentation (from functions to files) for code repositories, creating structured semantic abstractions that enable LLMs to comprehend repository-level context and dependencies. Specifically, RepoRepair first employs a text-based LLM (e.g., DeepSeek-V3) to generate file/function-level code documentation for repositories, which serves as auxiliary knowledge to guide fault localization. Subsequently, based on the fault localization results and the issue description, a powerful LLM (e.g., Claude-4) attempts to repair the ide...