Llms Machine Learning Ai Infrastructure Ai Startups

[P] Gemma 4 running on NVIDIA B200 and AMD MI355X from the same inference stack, 15% throughput gain over vLLM on Blackwell

Reddit - Machine Learning April 02, 2026 1 min read

About this article

Google DeepMind dropped Gemma 4 today: Gemma 4 31B: dense, 256K context, redesigned architecture targeting efficiency and long-context quality Gemma 4 26B A4B: MoE, 26B total / 4B active per forward pass, 256K context Both are natively multimodal (text, image, video, dynamic resolution). We got both running on MAX on launch day across NVIDIA B200 and AMD MI355X from the same stack. On B200 we're seeing 15% higher output throughput vs. vLLM (happy to share more on methodology if useful). Free ...

You've been blocked by network security.To continue, log in to your Reddit account or use your developer tokenIf you think you've been blocked by mistake, file a ticket below and we'll look into it.Log in File a ticket

Originally published on April 02, 2026. Curated by AI News.

Read Original Article

Llms

Claude Source Code?

Has anyone been able to successfully download the leaked source code yet? I've not been able to find it. If anyone has, please reach out....

Reddit - Artificial Intelligence · 1 min · 4 minutes ago

Llms

[R] Solving the Jane Street Dormant LLM Challenge: A Systematic Approach to Backdoor Discovery

Submitted by: Adam Kruger Date: March 23, 2026 Models Solved: 3/3 (M1, M2, M3) + Warmup Background When we first encountered the Jane Str...

Reddit - Machine Learning · 1 min · 34 minutes ago

Llms

Cursor Launches a New AI Agent Experience to Take On Claude Code and Codex | WIRED

As Cursor launches the next generation of its product, the AI coding startup has to compete with OpenAI and Anthropic more directly than ...

Wired - AI · 8 min · about 2 hours ago

Llms

[P] Gemma 4 running on NVIDIA B200 and AMD MI355X from the same inference stack, 15% throughput gain over vLLM on Blackwell

About this article

Related Articles

Claude Source Code?

[R] Solving the Jane Street Dormant LLM Challenge: A Systematic Approach to Backdoor Discovery

Cursor Launches a New AI Agent Experience to Take On Claude Code and Codex | WIRED

Anthropic leak reveals Claude Code tracks user frustration and raises new questions about AI privacy

No comments

Stay updated with AI News