[2603.19643] OmniDiT: Extending Diffusion Transformer to Omni-VTON

[2603.19643] OmniDiT: Extending Diffusion Transformer to Omni-VTON Framework

arXiv - AI March 23, 2026 4 min read

About this article

Abstract page for arXiv paper 2603.19643: OmniDiT: Extending Diffusion Transformer to Omni-VTON Framework

Computer Science > Computer Vision and Pattern Recognition arXiv:2603.19643 (cs) [Submitted on 20 Mar 2026] Title:OmniDiT: Extending Diffusion Transformer to Omni-VTON Framework Authors:Weixuan Zeng, Pengcheng Wei, Huaiqing Wang, Boheng Zhang, Jia Sun, Dewen Fan, Lin HE, Long Chen, Qianqian Gan, Fan Yang, Tingting Gao View a PDF of the paper titled OmniDiT: Extending Diffusion Transformer to Omni-VTON Framework, by Weixuan Zeng and 10 other authors View PDF HTML (experimental) Abstract:Despite the rapid advancement of Virtual Try-On (VTON) and Try-Off (VTOFF) technologies, existing VTON methods face challenges with fine-grained detail preservation, generalization to complex scenes, complicated pipeline, and efficient inference. To tackle these problems, we propose OmniDiT, an omni Virtual Try-On framework based on the Diffusion Transformer, which combines try-on and try-off tasks into one unified model. Specifically, we first establish a self-evolving data curation pipeline to continuously produce data, and construct a large VTON dataset Omni-TryOn, which contains over 380k diverse and high-quality garment-model-tryon image pairs and detailed text prompts. Then, we employ the token concatenation and design an adaptive position encoding to effectively incorporate multiple reference conditions. To relieve the bottleneck of long sequence computation, we are the first to introduce Shifted Window Attention into the diffusion model, thus achieving a linear complexity. To remedy ...

Originally published on March 23, 2026. Curated by AI News.

Machine Learning

Concerns About AI Model Capabilities Drive Down Cybersecurity Stocks

Concerns about the capabilities of an artificial intelligence (AI) model being tested by Anthropic drove down cybersecurity stocks on Fri...

AI Tools & Products · 4 min · 7 minutes ago

Llms

Meta is running intensive AI training weeks to get employees testing agents and coding with Claude

Meta's latest internal push are AI training weeks. CEO Mark Zuckerberg says 2026 is the year AI will "dramatically change" work at Meta.

AI Tools & Products · 5 min · 7 minutes ago

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 1 hour ago

Llms

[Project] PentaNet: Pushing beyond BitNet with Native Pentanary {-2, -1, 0, 1, 2} Quantization (124M, zero-multiplier inference)

Hey everyone, I've been experimenting with extreme LLM quantization following the BitNet 1.58b paper. While ternary quantization {-1, 0, ...

Reddit - Machine Learning · 1 min · about 2 hours ago

[2603.19643] OmniDiT: Extending Diffusion Transformer to Omni-VTON Framework

About this article

Related Articles

Concerns About AI Model Capabilities Drive Down Cybersecurity Stocks

Meta is running intensive AI training weeks to get employees testing agents and coding with Claude

UMKC Announces New Master of Science in Artificial Intelligence

[Project] PentaNet: Pushing beyond BitNet with Native Pentanary {-2, -1, 0, 1, 2} Quantization (124M, zero-multiplier inference)

No comments

Stay updated with AI News