[2603.00575] SWE-Hub: A Unified Production System for Scalable, Executable Software Engineering Tasks
About this article
Abstract page for arXiv paper 2603.00575: SWE-Hub: A Unified Production System for Scalable, Executable Software Engineering Tasks
Computer Science > Artificial Intelligence arXiv:2603.00575 (cs) [Submitted on 28 Feb 2026] Title:SWE-Hub: A Unified Production System for Scalable, Executable Software Engineering Tasks Authors:Yucheng Zeng, Shupeng Li, Daxiang Dong, Ruijie Xu, Zimo Chen, Liwei Zheng, Yuxuan Li, Zhe Zhou, Haotian Zhao, Lun Tian, Heng Xiao, Tianshu Zhu, Longkun Hao, Jianmin Wu View a PDF of the paper titled SWE-Hub: A Unified Production System for Scalable, Executable Software Engineering Tasks, by Yucheng Zeng and 13 other authors View PDF HTML (experimental) Abstract:Progress in software-engineering agents is increasingly constrained by the scarcity of executable, scalable, and realistic data for training and evaluation. This scarcity stems from three fundamental challenges in existing pipelines: environments are brittle and difficult to reproduce across languages; synthesizing realistic, system-level bugs at scale is computationally expensive; and existing data predominantly consists of short-horizon repairs, failing to capture long-horizon competencies like architectural consistency. We introduce \textbf{SWE-Hub}, an end-to-end system that operationalizes the data factory abstraction by unifying environment automation, scalable synthesis, and diverse task generation into a coherent production stack. At its foundation, the \textbf{Env Agent} establishes a shared execution substrate by automatically converting raw repository snapshots into reproducible, multi-language container environme...