[2603.26866] LACON: Training Text-to-Image Model from Uncurated Data

arXiv - AI March 31, 2026 3 min read

About this article

Abstract page for arXiv paper 2603.26866: LACON: Training Text-to-Image Model from Uncurated Data

Computer Science > Computer Vision and Pattern Recognition arXiv:2603.26866 (cs) [Submitted on 27 Mar 2026] Title:LACON: Training Text-to-Image Model from Uncurated Data Authors:Zhiyang Liang, Ziyu Wan, Hongyu Liu, Dong Chen, Qiu Shen, Hao Zhu, Dongdong Chen View a PDF of the paper titled LACON: Training Text-to-Image Model from Uncurated Data, by Zhiyang Liang and Ziyu Wan and Hongyu Liu and Dong Chen and Qiu Shen and Hao Zhu and Dongdong Chen View PDF HTML (experimental) Abstract:The success of modern text-to-image generation is largely attributed to massive, high-quality datasets. Currently, these datasets are curated through a filter-first paradigm that aggressively discards low-quality raw data based on the assumption that it is detrimental to model performance. Is the discarded bad data truly useless, or does it hold untapped potential? In this work, we critically re-examine this question. We propose LACON (Labeling-and-Conditioning), a novel training framework that exploits the underlying uncurated data distribution. Instead of filtering, LACON re-purposes quality signals, such as aesthetic scores and watermark probabilities as explicit, quantitative condition labels. The generative model is then trained to learn the full spectrum of data quality, from bad to good. By learning the explicit boundary between high- and low-quality content, LACON achieves superior generation quality compared to baselines trained only on filtered data using the same compute budget, provi...

Originally published on March 31, 2026. Curated by AI News.

Llms

Claude Opus 4.6 API at 40% below Anthropic pricing – try free before you pay anything

Hey everyone I've set up a self-hosted API gateway using [New-API](QuantumNous/new-ap) to manage and distribute Claude Opus 4.6 access ac...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Machine Learning

[D] ICML reviewer making up false claim in acknowledgement, what to do?

In a rebuttal acknowledgement we received, the reviewer made up a claim that our method performs worse than baselines with some hyperpara...

Reddit - Machine Learning · 1 min · about 2 hours ago

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 4 hours ago

Machine Learning

[D] Budget Machine Learning Hardware

Looking to get into machine learning and found this video on a piece of hardware for less than £500. Is it really possible to teach auton...

Reddit - Machine Learning · 1 min · about 6 hours ago

[2603.26866] LACON: Training Text-to-Image Model from Uncurated Data

About this article

Related Articles

Claude Opus 4.6 API at 40% below Anthropic pricing – try free before you pay anything

[D] ICML reviewer making up false claim in acknowledgement, what to do?

UMKC Announces New Master of Science in Artificial Intelligence

[D] Budget Machine Learning Hardware

No comments

Stay updated with AI News