Llms Open Source Ai Machine Learning Nlp

Arc Gate —LLM proxy that hits P=1.00 R=1.00 F1=1.00 on indirect/roleplay prompt injection (beats OpenAI Moderation and LlamaGuard)

Reddit - Artificial Intelligence April 28, 2026 1 min read

About this article

Benchmarked on 40 out-of-distribution prompts, indirect requests, roleplay framings, hypothetical scenarios, technical phrasings. The stuff that slips past everything else. Arc Gate: P=1.00, R=1.00, F1=1.00 OpenAI Moderation API: P=1.00, R=0.75, F1=0.86 LlamaGuard 3 8B: P=1.00, R=0.55, F1=0.71 Zero false positives. Zero misses. Blocked prompts average 329ms and never reach your model. Detection overhead is ~350ms on top of your normal upstream latency. Sits in front of any OpenAI-compatible e...

You've been blocked by network security.To continue, log in to your Reddit account or use your developer tokenIf you think you've been blocked by mistake, file a ticket below and we'll look into it.Log in File a ticket

Originally published on April 28, 2026. Curated by AI News.

Read Original Article

Llms

Claude can now plug directly into Photoshop, Blender, and Ableton | The Verge

Anthropic has launched a set of connectors for Claude that allow the AI chatbot to tap into popular creative software

The Verge - AI · 4 min · about 2 hours ago

Llms

Built a multiplayer map where you can see everyone's Claude Code activity as creatures battling it out

Hello r/artificial I built this specifically for Claude Code users - every prompt you run feeds a digital pet called a Prompt Creature. T...

Reddit - Artificial Intelligence · 1 min · about 4 hours ago

Llms

The loss curve said tie. The judges said otherwise. Seeking replication for an early LLM training result [R]

TL;DR - I've written two novel functions that shape the training signal for LLMs. Early tests show people prefer responses from models tr...

Reddit - Machine Learning · 1 min · about 5 hours ago

Llms

Karpathy dropped a 200-line GPT, so I used the math to turn pandas DataFrames into searchable context windows and open sourced it (and automated my stats pipeline). [P]

TL;DR: I got tired of manually running Shapiro-Wilk tests and copy-pasting p-values at 2 AM. I built an open-source, async Python pipelin...

Arc Gate —LLM proxy that hits P=1.00 R=1.00 F1=1.00 on indirect/roleplay prompt injection (beats OpenAI Moderation and LlamaGuard)

About this article

Related Articles

Claude can now plug directly into Photoshop, Blender, and Ableton | The Verge

Built a multiplayer map where you can see everyone's Claude Code activity as creatures battling it out

The loss curve said tie. The judges said otherwise. Seeking replication for an early LLM training result [R]

Karpathy dropped a 200-line GPT, so I used the math to turn pandas DataFrames into searchable context windows and open sourced it (and automated my stats pipeline). [P]

No comments

Stay updated with AI News