[2411.00759] Minibatch Optimal Transport and Perplexity Bound Estimation in Discrete Flow Matching

[2411.00759] Minibatch Optimal Transport and Perplexity Bound Estimation in Discrete Flow Matching

arXiv - Machine Learning 4 min read Article

Summary

This paper introduces a novel approach to discrete flow matching using minibatch optimal transport, enhancing generative performance while minimizing state transitions.

Why It Matters

The research addresses limitations in discrete flow matching, a growing area in machine learning. By proposing efficient methods to optimize transitions and perplexity estimation, it contributes to advancements in generative models, which are crucial for applications in natural language processing and beyond.

Key Takeaways

  • Introduces a dynamic-optimal-transport-like minimization objective for discrete flows.
  • Demonstrates up to a 32-fold reduction in transitions to achieve similar generative perplexity.
  • Proposes two upper bounds on perplexity for improved training and evaluation.
  • Introduces Multimask Flows that outperform existing models in generative perplexity.
  • Highlights the importance of minibatch strategies in optimizing transport costs.

Computer Science > Machine Learning arXiv:2411.00759 (cs) [Submitted on 1 Nov 2024 (v1), last revised 23 Feb 2026 (this version, v4)] Title:Minibatch Optimal Transport and Perplexity Bound Estimation in Discrete Flow Matching Authors:Etrit Haxholli, Yeti Z. Gurbuz, Ogul Can, Eli Waxman View a PDF of the paper titled Minibatch Optimal Transport and Perplexity Bound Estimation in Discrete Flow Matching, by Etrit Haxholli and 3 other authors View PDF HTML (experimental) Abstract:Discrete flow matching, a recent framework for modeling categorical data, has shown competitive performance with autoregressive models. However, unlike continuous flow matching, the rectification strategy cannot be applied due to the stochasticity of discrete paths, necessitating alternative methods to minimize state transitions. We propose a dynamic-optimal-transport-like minimization objective and derive its Kantorovich formulation for discrete flows with convex interpolants, where transport cost depends solely on inter-state similarity and can be optimized via minibatch strategies. We show that such methods can reduce the number of transitions up to 32 times (1024 to 32) to reach the same generative perplexity without compromising diversity. Additionally, path nondeterminism in discrete flows precludes an instantaneous change-of-variables analogue, preventing precise probability estimation available to continuous flows. We therefore propose two upper bounds on perplexity, enabling principled traini...

Related Articles

UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Machine Learning

[D] Physicist-turned-ML-engineer looking to get into ML research. What's worth working on and where can I contribute most?

After years of focus on building products, I'm carving out time to do independent research again and trying to find the right direction. ...

Reddit - Machine Learning · 1 min ·
PSA: Anyone with a link can view your Granola notes by default | The Verge
Machine Learning

PSA: Anyone with a link can view your Granola notes by default | The Verge

Granola, the AI-powered note-taking app, makes your notes viewable by anyone with a link by default. It also turns on AI training for any...

The Verge - AI · 5 min ·
Machine Learning

[D] On-Device Real-Time Visibility Restoration: Deterministic CV vs. Quantized ML Models. Looking for insights on Edge Preservation vs. Latency.

Hey everyone, We have been working on a real-time camera engine for iOS that currently uses a purely deterministic Computer Vision approa...

Reddit - Machine Learning · 1 min ·
More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime