Machine Learning Ai Agents Ai Startups

[D] Your Agent, Their Asset: Real-world safety evaluation of OpenClaw agents (CIK poisoning raises attack success to ~64–74%)

Reddit - Machine Learning April 08, 2026 1 min read

About this article

Paper: https://arxiv.org/abs/2604.04759 This paper presents a real-world safety evaluation of OpenClaw, a personal AI agent with access to Gmail, Stripe, and the local filesystem. The authors introduce a taxonomy of persistent agent state: - Capability (skills / executable code) - Identity (persona, trust configuration) - Knowledge (memory) They evaluate 12 attack scenarios on a live system across multiple models. Key results: - baseline attack success rate: ~10–36.7% - after poisoning a sing...

You've been blocked by network security.To continue, log in to your Reddit account or use your developer tokenIf you think you've been blocked by mistake, file a ticket below and we'll look into it.Log in File a ticket

Originally published on April 08, 2026. Curated by AI News.

Read Original Article

Llms

Anthropic's latest AI model identifies 'thousands of zero-day vulnerabilities' in 'every major operating system and every major web browser' — Claude Mythos Preview sparks race to fix critical bugs, some unpatched for decades

AI Tools & Products · 6 min · 8 minutes ago

Machine Learning

Anthropic says its latest AI model is too powerful for public release and that it broke containment during testing

AI Tools & Products · 5 min · 8 minutes ago

Llms

Thinking small: How small language models could lessen the AI energy burden

According to researchers, for many industries, small language models may offer a host of advantages to energy- and resource-intensive lar...

AI Tools & Products · 5 min · 8 minutes ago

Machine Learning

Anthropic says its most powerful AI cyber model is too dangerous to release publicly — so it built Project Glasswing

AI Tools & Products · 8 minutes ago

More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime