[D] Your Agent, Their Asset: Real-world safety evaluation of OpenClaw agents (CIK poisoning raises attack success to ~64–74%)
About this article
Paper: https://arxiv.org/abs/2604.04759 This paper presents a real-world safety evaluation of OpenClaw, a personal AI agent with access to Gmail, Stripe, and the local filesystem. The authors introduce a taxonomy of persistent agent state: - Capability (skills / executable code) - Identity (persona, trust configuration) - Knowledge (memory) They evaluate 12 attack scenarios on a live system across multiple models. Key results: - baseline attack success rate: ~10–36.7% - after poisoning a sing...
You've been blocked by network security.To continue, log in to your Reddit account or use your developer tokenIf you think you've been blocked by mistake, file a ticket below and we'll look into it.Log in File a ticket