[2603.01068] LLaDA-o: An Effective and Length-Adaptive Omni Diffusion

[2603.01068] LLaDA-o: An Effective and Length-Adaptive Omni Diffusion Model

arXiv - Machine Learning March 03, 2026 3 min read

About this article

Abstract page for arXiv paper 2603.01068: LLaDA-o: An Effective and Length-Adaptive Omni Diffusion Model

Computer Science > Computer Vision and Pattern Recognition arXiv:2603.01068 (cs) [Submitted on 1 Mar 2026] Title:LLaDA-o: An Effective and Length-Adaptive Omni Diffusion Model Authors:Zebin You, Xiaolu Zhang, Jun Zhou, Chongxuan Li, Ji-Rong Wen View a PDF of the paper titled LLaDA-o: An Effective and Length-Adaptive Omni Diffusion Model, by Zebin You and 4 other authors View PDF HTML (experimental) Abstract:We present \textbf{LLaDA-o}, an effective and length-adaptive omni diffusion model for multimodal understanding and generation. LLaDA-o is built on a Mixture of Diffusion (MoD) framework that decouples discrete masked diffusion for text understanding and continuous diffusion for visual generation, while coupling them through a shared, simple, and efficient attention backbone that reduces redundant computation for fixed conditions. Building on MoD, we further introduce a data-centric length adaptation strategy that enables flexible-length decoding in multimodal settings without architectural changes. Extensive experiments show that LLaDA-o achieves state-of-the-art performance among omni-diffusion models on multimodal understanding and generation benchmarks, and reaches 87.04 on DPG-Bench for text-to-image generation, supporting the effectiveness of unified omni diffusion modeling. Code is available at this https URL. Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG) Cite as: arXiv:2603.01068 [cs.CV] (or arXiv:2603.01068v1 [cs.CV] for ...

Originally published on March 03, 2026. Curated by AI News.

Llms

Von Hammerstein’s Ghost: What a Prussian General’s Officer Typology Can Teach Us About AI Misalignment

Greetings all - I've posted mostly in r/claudecode and r/aigamedev a couple of times previously. Working with CC for personal projects re...

Reddit - Artificial Intelligence · 1 min · 5 minutes ago

Llms

World models will be the next big thing, bye-bye LLMs

Was at Nvidia's GTC conference recently and honestly, it was one of the most eye-opening events I've attended in a while. There was a lot...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

Machine Learning

[D] Got my first offer after months of searching — below posted range, contract-to-hire, and worried it may pause my search. Do I take it?

I could really use some outside perspective. I’m a senior ML/CV engineer in Canada with about 5–6 years across research and industry. Mas...

Reddit - Machine Learning · 1 min · about 3 hours ago

Machine Learning

[Research] AI training is bad, so I started an research

Hello, I started researching about AI training Q:Why? R: Because AI training is bad right now. Q: What do you mean its bad? R: Like when ...