[2505.18646] SEW: Self-Evolving Agentic Workflows for Automated Code

[2505.18646] SEW: Self-Evolving Agentic Workflows for Automated Code Generation

arXiv - AI April 15, 2026 3 min read

About this article

Abstract page for arXiv paper 2505.18646: SEW: Self-Evolving Agentic Workflows for Automated Code Generation

Computer Science > Software Engineering arXiv:2505.18646 (cs) [Submitted on 24 May 2025 (v1), last revised 14 Apr 2026 (this version, v2)] Title:SEW: Self-Evolving Agentic Workflows for Automated Code Generation Authors:Siwei Liu, Jinyuan Fang, Han Zhou, Yingxu Wang, Zaiqiao Meng View a PDF of the paper titled SEW: Self-Evolving Agentic Workflows for Automated Code Generation, by Siwei Liu and 4 other authors View PDF HTML (experimental) Abstract:Large Language Models (LLMs) have demonstrated effectiveness in code generation tasks. To enable LLMs to address more complex coding challenges, existing research has focused on crafting multi-agent systems with agentic workflows, where complex coding tasks are decomposed into sub-tasks, assigned to specialized agents. Despite their effectiveness, current approaches heavily rely on hand-crafted agentic workflows, with both agent topologies and prompts manually designed, which limits their ability to automatically adapt to different types of coding problems. To address these limitations and enable automated workflow design, we propose \textbf{S}elf-\textbf{E}volving \textbf{W}orkflow (\textbf{SEW}), a novel self-evolving framework that automatically generates and optimises multi-agent workflows. Extensive experiments on three coding benchmark datasets, including the challenging LiveCodeBench, demonstrate that our SEW can automatically design agentic workflows and optimise them through self-evolution, bringing up to 12\% improvement...

Originally published on April 15, 2026. Curated by AI News.

Llms

Value Realignment is here.

The "value realignment" at the intersection of quantum computing, AI, and robotics feels like a necessary shift. We have spent so much ti...

Reddit - Artificial Intelligence · 1 min · 26 minutes ago

Llms

Built GPT-2, Llama 3, and DeepSeek from scratch in PyTorch - open source code + book [p]

I spent the past year implementing five LLM architectures from scratch in PyTorch and wrote a book documenting the process. What's covere...

Reddit - Machine Learning · 1 min · about 3 hours ago

Llms

Jailbreaks as social engineering: 5 case studies suggest LLMs inherit human psychological vulnerabilities from training data [D]

Writeup documenting 5 psychological manipulation experiments on LLMs (GPT-4, GPT-4o, Claude 3.5 Sonnet) from 2023-2024. Each case applies...

Reddit - Machine Learning · 1 min · about 3 hours ago

Llms

One of the fastest ways to lose trust in a self-hosted LLM: prompt injection compliance

One production problem that feels bigger than people admit: a model looks fine, sounds safe, and then gives away too much the moment some...

Reddit - Artificial Intelligence · 1 min · about 5 hours ago

[2505.18646] SEW: Self-Evolving Agentic Workflows for Automated Code Generation

About this article

Related Articles

Value Realignment is here.

Built GPT-2, Llama 3, and DeepSeek from scratch in PyTorch - open source code + book [p]

Jailbreaks as social engineering: 5 case studies suggest LLMs inherit human psychological vulnerabilities from training data [D]

One of the fastest ways to lose trust in a self-hosted LLM: prompt injection compliance

No comments

Stay updated with AI News