[2603.20155] Beyond Single Tokens: Distilling Discrete Diffusion Models via Discrete MMD

[2603.20155] Beyond Single Tokens: Distilling Discrete Diffusion Models via Discrete MMD

arXiv - Machine Learning 3 min read

About this article

Abstract page for arXiv paper 2603.20155: Beyond Single Tokens: Distilling Discrete Diffusion Models via Discrete MMD

Computer Science > Machine Learning arXiv:2603.20155 (cs) [Submitted on 20 Mar 2026] Title:Beyond Single Tokens: Distilling Discrete Diffusion Models via Discrete MMD Authors:Emiel Hoogeboom, David Ruhe, Jonathan Heek, Thomas Mensink, Tim Salimans View a PDF of the paper titled Beyond Single Tokens: Distilling Discrete Diffusion Models via Discrete MMD, by Emiel Hoogeboom and 4 other authors View PDF Abstract:It is currently difficult to distill discrete diffusion models. In contrast, continuous diffusion literature has many distillation approaches methods that can reduce sampling steps to a handful. Our method, Discrete Moment Matching Distillation (D-MMD), leverages ideas that have been highly successful in the continuous domain. Whereas previous discrete distillation methods collapse, D-MMD maintains high quality and diversity (given sufficient sampling steps). This is demonstrated on both text and image datasets. Moreover, the newly distilled generators can outperform their teachers. Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML) Cite as: arXiv:2603.20155 [cs.LG]   (or arXiv:2603.20155v1 [cs.LG] for this version)   https://doi.org/10.48550/arXiv.2603.20155 Focus to learn more arXiv-issued DOI via DataCite (pending registration) Submission history From: Emiel Hoogeboom [view email] [v1] Fri, 20 Mar 2026 17:29:12 UTC (801 KB) Full-text links: Access Paper: View a PDF of the paper titled Beyond Single Tokens...

Originally published on March 23, 2026. Curated by AI News.

Related Articles

Machine Learning for Health Zimbabwe 2026
Machine Learning

Machine Learning for Health Zimbabwe 2026

The collaborative initiative is illustrative of the convergence science taking place at Imperial, bringing cross-departmental expertise t...

AI News - General · 8 min ·
Machine Learning is Making Personality Tests 4x Faster
Machine Learning

Machine Learning is Making Personality Tests 4x Faster

Can AI predict your personality? New research shows machine learning can deliver DISC assessments 4x faster and with 93% accuracy. Learn ...

AI News - General · 8 min ·
AAMU named regional lead for Amazon Web Services – Machine Learning University
Machine Learning

AAMU named regional lead for Amazon Web Services – Machine Learning University

Alabama A&M University has been chosen as a regional lead for Amazon Web Services–Machine Learning University, boosting its role in AI ed...

AI News - General · 4 min ·
UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime