[2510.01478] Purrception: Variational Flow Matching for

[2510.01478] Purrception: Variational Flow Matching for Vector-Quantized Image Generation

arXiv - Machine Learning March 03, 2026 3 min read

About this article

Abstract page for arXiv paper 2510.01478: Purrception: Variational Flow Matching for Vector-Quantized Image Generation

Computer Science > Computer Vision and Pattern Recognition arXiv:2510.01478 (cs) [Submitted on 1 Oct 2025 (v1), last revised 1 Mar 2026 (this version, v3)] Title:Purrception: Variational Flow Matching for Vector-Quantized Image Generation Authors:Răzvan-Andrei Matişan, Vincent Tao Hu, Grigory Bartosh, Björn Ommer, Cees G. M. Snoek, Max Welling, Jan-Willem van de Meent, Mohammad Mahdi Derakhshani, Floor Eijkelboom View a PDF of the paper titled Purrception: Variational Flow Matching for Vector-Quantized Image Generation, by R\u{a}zvan-Andrei Mati\c{s}an and 8 other authors View PDF Abstract:We introduce Purrception, a variational flow matching approach for vector-quantized image generation that provides explicit categorical supervision while maintaining continuous transport dynamics. Our method adapts Variational Flow Matching to vector-quantized latents by learning categorical posteriors over codebook indices while computing velocity fields in the continuous embedding space. This combines the geometric awareness of continuous methods with the discrete supervision of categorical approaches, enabling uncertainty quantification over plausible codes and temperature-controlled generation. We evaluate Purrception on ImageNet-1k 256x256 generation. Training converges faster than both continuous flow matching and discrete flow matching baselines while achieving competitive FID scores with state-of-the-art models. This demonstrates that Variational Flow Matching can effectively bri...

Originally published on March 03, 2026. Curated by AI News.

Generative Ai

[2602.08277] PISCO: Precise Video Instance Insertion with Sparse Control

Abstract page for arXiv paper 2602.08277: PISCO: Precise Video Instance Insertion with Sparse Control

arXiv - AI · 4 min · about 16 hours ago

Machine Learning

[2511.18746] Any4D: Open-Prompt 4D Generation from Natural Language and Images

Abstract page for arXiv paper 2511.18746: Any4D: Open-Prompt 4D Generation from Natural Language and Images

arXiv - AI · 4 min · about 16 hours ago

Llms

[2512.14549] Dual-objective Language Models: Training Efficiency Without Overfitting

Abstract page for arXiv paper 2512.14549: Dual-objective Language Models: Training Efficiency Without Overfitting

arXiv - AI · 3 min · about 16 hours ago

Llms

[2510.21011] Generating the Modal Worker: A Cross-Model Audit of Race and Gender in LLM-Generated Personas Across 41 Occupations

Abstract page for arXiv paper 2510.21011: Generating the Modal Worker: A Cross-Model Audit of Race and Gender in LLM-Generated Personas A...

arXiv - AI · 4 min · about 16 hours ago

[2510.01478] Purrception: Variational Flow Matching for Vector-Quantized Image Generation

About this article

Related Articles

[2602.08277] PISCO: Precise Video Instance Insertion with Sparse Control

[2511.18746] Any4D: Open-Prompt 4D Generation from Natural Language and Images

[2512.14549] Dual-objective Language Models: Training Efficiency Without Overfitting

[2510.21011] Generating the Modal Worker: A Cross-Model Audit of Race and Gender in LLM-Generated Personas Across 41 Occupations

No comments

Stay updated with AI News