Welcome Gemma 4: Frontier multimodal intelligence on device

Hugging Face Blog April 02, 2026 25 min read

About this article

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Back to Articles Welcome Gemma 4: Frontier multimodal intelligence on device Published April 2, 2026 Update on GitHub Upvote 7 +1 merve merve Follow Pedro Cuenca pcuenq Follow Sergio Paniego sergiopaniego Follow ben burtenshaw burtenshaw Follow Steven Zheng Steveeeeeeen Follow Alvaro Bartolome alvarobartt Follow The Gemma 4 family of multimodal models by Google DeepMind is out on Hugging Face, with support for your favorite agents, inference engines, and fine-tuning libraries 🤗 These models are the real deal: truly open with Apache 2 licenses, high quality with pareto frontier arena scores, multimodal including audio, and sizes you can use everywhere including on-device. Gemma 4 builds on advances from previous families and makes them click together. In our tests with pre-release checkpoints we have been impressed by their capabilities, to the extent that we struggled to find good fine-tuning examples because they are so good out of the box. We collaborated with Google and the community to make them available everywhere: transformers, llama.cpp, MLX, WebGPU, Rust; you name it. This blog post will show you how to build with your favorite tools so let us know what you think! Table of Contents What is New with Gemma 4? Overview of Capabilities and Architecture Architecture at a Glance Per-Layer Embeddings (PLE) Shared KV Cache Multimodal Capabilities Deploy Anywhere transformers Llama.cpp Plug in to your local agent transformers.js MLX Mistral.rs Fine-tuning & Demos Fine-tuni...

Originally published on April 02, 2026. Curated by AI News.

Open Source Ai

we just hit 555 stars on our open source AI agent config tool and i'm honestly still in shock

so a while back me and a few folks started working on Caliber, an open source tool for managing AI agent configs and syncing them with yo...

Reddit - Artificial Intelligence · 1 min · about 13 hours ago

Llms

Google has published its new open-weight model Gemma 4. And made it commercially available under Apache 2.0 License

The model is also available here: 🤗 HuggingFace: https://huggingface.co/collections/google/gemma-4 🦙 Ollama: https://ollama.com/library/g...

Reddit - Artificial Intelligence · 1 min · 3 days ago

Llms

[2604.00021] How Do Language Models Process Ethical Instructions? Deliberation, Consistency, and Other-Recognition Across Four Models

Abstract page for arXiv paper 2604.00021: How Do Language Models Process Ethical Instructions? Deliberation, Consistency, and Other-Recog...

arXiv - AI · 4 min · 3 days ago

Llms

[2604.01106] Inverse Design of Optical Multilayer Thin Films using Robust Masked Diffusion Models

Abstract page for arXiv paper 2604.01106: Inverse Design of Optical Multilayer Thin Films using Robust Masked Diffusion Models

arXiv - Machine Learning · 3 min · 3 days ago

Welcome Gemma 4: Frontier multimodal intelligence on device

About this article

Related Articles

we just hit 555 stars on our open source AI agent config tool and i'm honestly still in shock

Google has published its new open-weight model Gemma 4. And made it commercially available under Apache 2.0 License

[2604.00021] How Do Language Models Process Ethical Instructions? Deliberation, Consistency, and Other-Recognition Across Four Models

[2604.01106] Inverse Design of Optical Multilayer Thin Films using Robust Masked Diffusion Models

No comments

Stay updated with AI News