[2604.00007] Dynin-Omni: Omnimodal Unified Large Diffusion Language

[2604.00007] Dynin-Omni: Omnimodal Unified Large Diffusion Language Model

arXiv - AI April 02, 2026 3 min read

About this article

Abstract page for arXiv paper 2604.00007: Dynin-Omni: Omnimodal Unified Large Diffusion Language Model

Computer Science > Computation and Language arXiv:2604.00007 (cs) [Submitted on 9 Mar 2026] Title:Dynin-Omni: Omnimodal Unified Large Diffusion Language Model Authors:Jaeik Kim, Woojin Kim, Jihwan Hong, Yejoon Lee, Sieun Hyeon, Mintaek Lim, Yunseok Han, Dogeun Kim, Hoeun Lee, Hyunggeun Kim, Jaeyoung Do View a PDF of the paper titled Dynin-Omni: Omnimodal Unified Large Diffusion Language Model, by Jaeik Kim and 10 other authors View PDF HTML (experimental) Abstract:We present Dynin-Omni, the first masked-diffusion-based omnimodal foundation model that unifies text, image, and speech understanding and generation, together with video understanding, within a single architecture. Unlike autoregressive unified models that serialize heterogeneous modalities, or compositional unified models that require orchestration with external modality-specific decoders, Dynin-Omni natively formulates omnimodal modeling as masked diffusion over a shared discrete token space, enabling iterative refinement under bidirectional context. Dynin-Omni adopts a multi-stage training strategy with model-merging-based modality expansion and omnimodal alignment. We evaluate Dynin-Omni across 19 multimodal benchmarks spanning language reasoning, image generation and editing, video understanding, and speech recognition and synthesis. Dynin-Omni achieves 87.6 on GSM8K, 1733.6 on MME-P, 61.4 on VideoMME, 0.87 on GenEval, and 2.1 WER on LibriSpeech test-clean, consistently outperforming existing open-source uni...

Originally published on April 02, 2026. Curated by AI News.

Llms

Chatgpt vs purpose built ai for cre underwriting: which one can finish the job?

I keep seeing people recommend chatgpt for financial modeling and I need to push back because I spent a month testing it for multifamily ...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Llms

[2512.02966] Lumos: Let there be Language Model System Certification

Abstract page for arXiv paper 2512.02966: Lumos: Let there be Language Model System Certification

arXiv - AI · 4 min · about 7 hours ago

Llms

[2602.00750] Bypassing Prompt Injection Detectors through Evasive Injections

Abstract page for arXiv paper 2602.00750: Bypassing Prompt Injection Detectors through Evasive Injections

arXiv - AI · 4 min · about 7 hours ago

Llms

[2511.08225] Benchmarking Educational LLMs with Analytics: A Case Study on Gender Bias in Feedback

Abstract page for arXiv paper 2511.08225: Benchmarking Educational LLMs with Analytics: A Case Study on Gender Bias in Feedback

arXiv - AI · 4 min · about 7 hours ago

[2604.00007] Dynin-Omni: Omnimodal Unified Large Diffusion Language Model

About this article

Related Articles

Chatgpt vs purpose built ai for cre underwriting: which one can finish the job?

[2512.02966] Lumos: Let there be Language Model System Certification

[2602.00750] Bypassing Prompt Injection Detectors through Evasive Injections

[2511.08225] Benchmarking Educational LLMs with Analytics: A Case Study on Gender Bias in Feedback

No comments

Stay updated with AI News