[2603.27524] Safer Builders, Risky Maintainers: A Comparative Study of Breaking Changes in Human vs Agentic PRs

[2603.27524] Safer Builders, Risky Maintainers: A Comparative Study of Breaking Changes in Human vs Agentic PRs

arXiv - AI 4 min read

About this article

Abstract page for arXiv paper 2603.27524: Safer Builders, Risky Maintainers: A Comparative Study of Breaking Changes in Human vs Agentic PRs

Computer Science > Software Engineering arXiv:2603.27524 (cs) [Submitted on 29 Mar 2026] Title:Safer Builders, Risky Maintainers: A Comparative Study of Breaking Changes in Human vs Agentic PRs Authors:K M Ferdous, Dipayan Banik, Kowshik Chowdhury, Shazibul Islam Shamim View a PDF of the paper titled Safer Builders, Risky Maintainers: A Comparative Study of Breaking Changes in Human vs Agentic PRs, by K M Ferdous and 3 other authors View PDF HTML (experimental) Abstract:AI coding agents are increasingly integrated into modern software engineering workflows, actively collaborating with human developers to create pull requests (PRs) in open-source repositories. Although coding agents improve developer productivity, they often generate code with more bugs and security issues than human-authored code. While human-authored PRs often break backward compatibility, leading to breaking changes, the potential for agentic PRs to introduce breaking changes remains underexplored. The goal of this paper is to help developers and researchers evaluate the reliability of AI-generated PRs by examining the frequency and task contexts in which AI agents introduce breaking changes. We conduct a comparative analysis of 7,191 agent-generated PRs with 1402 human-authored PRs from Python repositories in the AIDev dataset. We develop a tool that analyzes code changes in commits corresponding to the agentic PRs and leverages an abstract syntax tree (AST) based analysis to detect potential breaking c...

Originally published on March 31, 2026. Curated by AI News.

Related Articles

Ai Agents

Considering NeurIPS submission [D]

Wondering if it worth submitting paper I’m working on to NeurIPS. I have formal mathematical proof for convergence of a novel agentic sys...

Reddit - Machine Learning · 1 min ·
Ai Agents

Agent frameworks waste ~350,000+ tokens per session resending static files. 95% reduction benchmarked.

Measured the actual token waste on a local Qwen 3.5 122B setup. The numbers are unreal. Found a compile-time approach that cuts query con...

Reddit - Artificial Intelligence · 1 min ·
OpenClaw gives users yet another reason to be freaked out about security - Ars Technica
Ai Agents

OpenClaw gives users yet another reason to be freaked out about security - Ars Technica

The viral AI agentic tool let attackers silently gain admin unauthenticated access.

Ars Technica - AI · 5 min ·
Robotics

What happens when you let AI agents run a sitcom 24/7 with zero human involvement

Ran an experiment — gave AI agents full control over writing, character creation, and performing a sitcom. Left it running nonstop for ov...

Reddit - Artificial Intelligence · 1 min ·
More in Ai Agents: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime