[2603.27524] Safer Builders, Risky Maintainers: A Comparative Study of

[2603.27524] Safer Builders, Risky Maintainers: A Comparative Study of Breaking Changes in Human vs Agentic PRs

arXiv - AI March 31, 2026 4 min read

About this article

Abstract page for arXiv paper 2603.27524: Safer Builders, Risky Maintainers: A Comparative Study of Breaking Changes in Human vs Agentic PRs

Computer Science > Software Engineering arXiv:2603.27524 (cs) [Submitted on 29 Mar 2026] Title:Safer Builders, Risky Maintainers: A Comparative Study of Breaking Changes in Human vs Agentic PRs Authors:K M Ferdous, Dipayan Banik, Kowshik Chowdhury, Shazibul Islam Shamim View a PDF of the paper titled Safer Builders, Risky Maintainers: A Comparative Study of Breaking Changes in Human vs Agentic PRs, by K M Ferdous and 3 other authors View PDF HTML (experimental) Abstract:AI coding agents are increasingly integrated into modern software engineering workflows, actively collaborating with human developers to create pull requests (PRs) in open-source repositories. Although coding agents improve developer productivity, they often generate code with more bugs and security issues than human-authored code. While human-authored PRs often break backward compatibility, leading to breaking changes, the potential for agentic PRs to introduce breaking changes remains underexplored. The goal of this paper is to help developers and researchers evaluate the reliability of AI-generated PRs by examining the frequency and task contexts in which AI agents introduce breaking changes. We conduct a comparative analysis of 7,191 agent-generated PRs with 1402 human-authored PRs from Python repositories in the AIDev dataset. We develop a tool that analyzes code changes in commits corresponding to the agentic PRs and leverages an abstract syntax tree (AST) based analysis to detect potential breaking c...

Originally published on March 31, 2026. Curated by AI News.

Ai Agents

Considering NeurIPS submission [D]

Wondering if it worth submitting paper I’m working on to NeurIPS. I have formal mathematical proof for convergence of a novel agentic sys...

Reddit - Machine Learning · 1 min · about 5 hours ago

Ai Agents

Agent frameworks waste ~350,000+ tokens per session resending static files. 95% reduction benchmarked.

Measured the actual token waste on a local Qwen 3.5 122B setup. The numbers are unreal. Found a compile-time approach that cuts query con...

Reddit - Artificial Intelligence · 1 min · about 10 hours ago

Ai Agents

OpenClaw gives users yet another reason to be freaked out about security - Ars Technica

The viral AI agentic tool let attackers silently gain admin unauthenticated access.

Ars Technica - AI · 5 min · about 13 hours ago

Robotics

What happens when you let AI agents run a sitcom 24/7 with zero human involvement

Ran an experiment — gave AI agents full control over writing, character creation, and performing a sitcom. Left it running nonstop for ov...

Reddit - Artificial Intelligence · 1 min · about 15 hours ago

[2603.27524] Safer Builders, Risky Maintainers: A Comparative Study of Breaking Changes in Human vs Agentic PRs

About this article

Related Articles

Considering NeurIPS submission [D]

Agent frameworks waste ~350,000+ tokens per session resending static files. 95% reduction benchmarked.

OpenClaw gives users yet another reason to be freaked out about security - Ars Technica

What happens when you let AI agents run a sitcom 24/7 with zero human involvement

No comments

Stay updated with AI News