[2601.13518] AgenticRed: Evolving Agentic Systems for Red-Teaming

arXiv - AI April 06, 2026 3 min read

About this article

Abstract page for arXiv paper 2601.13518: AgenticRed: Evolving Agentic Systems for Red-Teaming

Computer Science > Artificial Intelligence arXiv:2601.13518 (cs) [Submitted on 20 Jan 2026 (v1), last revised 3 Apr 2026 (this version, v3)] Title:AgenticRed: Evolving Agentic Systems for Red-Teaming Authors:Jiayi Yuan, Jonathan Nöther, Natasha Jaques, Goran Radanović View a PDF of the paper titled AgenticRed: Evolving Agentic Systems for Red-Teaming, by Jiayi Yuan and 3 other authors View PDF HTML (experimental) Abstract:While recent automated red-teaming methods show promise for systematically exposing model vulnerabilities, most existing approaches rely on human-specified workflows. This dependence on manually designed workflows suffers from human biases and makes exploring the broader design space expensive. We introduce AgenticRed, an automated pipeline that leverages LLMs' in-context learning to iteratively design and refine red-teaming systems without human intervention. Rather than optimizing attacker policies within predefined structures, AgenticRed treats red-teaming as a system design problem, and it autonomously evolves automated red-teaming systems using evolutionary selection and generational knowledge. Red-teaming systems designed by AgenticRed consistently outperform state-of-the-art approaches, achieving 96% attack success rate (ASR) on Llama-2-7B, 98% on Llama-3-8B and 100% on Qwen3-8B on HarmBench. Our approach generates robust, query-agnostic red-teaming systems that transfer strongly to the latest proprietary models, achieving an impressive 100% ASR on...

Originally published on April 06, 2026. Curated by AI News.

Llms

Anthropic Restricts Claude Agent Access Amid AI Automation Boom in Crypto

AI Tools & Products · 7 min · 29 minutes ago

Llms

Is cutting ‘please’ when talking to ChatGPT better for the planet? An expert explains

AI Tools & Products · 5 min · 29 minutes ago

Llms

AI Desktop 98 lets you chat with Claude, ChatGPT, and Gemini through a Windows 98-inspired interface

AI Tools & Products · 3 min · 29 minutes ago

Llms

Claude, OpenClaw and the new reality: AI agents are here — and so is the chaos

AI Tools & Products · 29 minutes ago

[2601.13518] AgenticRed: Evolving Agentic Systems for Red-Teaming

About this article

Related Articles

Anthropic Restricts Claude Agent Access Amid AI Automation Boom in Crypto

Is cutting ‘please’ when talking to ChatGPT better for the planet? An expert explains

AI Desktop 98 lets you chat with Claude, ChatGPT, and Gemini through a Windows 98-inspired interface

Claude, OpenClaw and the new reality: AI agents are here — and so is the chaos

No comments

Stay updated with AI News