[2603.28342] Kernel-Smith: A Unified Recipe for Evolutionary Kernel Optimization
About this article
Abstract page for arXiv paper 2603.28342: Kernel-Smith: A Unified Recipe for Evolutionary Kernel Optimization
Computer Science > Computation and Language arXiv:2603.28342 (cs) [Submitted on 30 Mar 2026] Title:Kernel-Smith: A Unified Recipe for Evolutionary Kernel Optimization Authors:He Du, Qiming Ge, Jiakai Hu, Aijun Yang, Zheng Cai, Zixian Huang, Sheng Yuan, Qinxiu Cheng, Xinchen Xie, Yicheng Chen, Yining Li, Jiaxing Xie, Huanan Dong, Yaguang Wu, Xiangjun Huang, Jian Yang, Hui Wang, Bowen Zhou, Bowen Li, Qipeng Guo, Kai Chen View a PDF of the paper titled Kernel-Smith: A Unified Recipe for Evolutionary Kernel Optimization, by He Du and 19 other authors View PDF HTML (experimental) Abstract:We present Kernel-Smith, a framework for high-performance GPU kernel and operator generation that combines a stable evaluation-driven evolutionary agent with an evolution-oriented post-training recipe. On the agent side, Kernel-Smith maintains a population of executable candidates and iteratively improves them using an archive of top-performing and diverse programs together with structured execution feedback on compilation, correctness, and speedup. To make this search reliable, we build backend-specific evaluation services for Triton on NVIDIA GPUs and Maca on MetaX GPUs. On the training side, we convert long-horizon evolution trajectories into step-centric supervision and reinforcement learning signals by retaining correctness-preserving, high-gain revisions, so that the model is optimized as a strong local improver inside the evolutionary loop rather than as a one-shot generator. Under a un...