[2604.04599] LP-GEMM: Integrating Layout Propagation into GEMM Operations

[2604.04599] LP-GEMM: Integrating Layout Propagation into GEMM Operations

arXiv - Machine Learning 3 min read

About this article

Abstract page for arXiv paper 2604.04599: LP-GEMM: Integrating Layout Propagation into GEMM Operations

Computer Science > Distributed, Parallel, and Cluster Computing arXiv:2604.04599 (cs) [Submitted on 6 Apr 2026] Title:LP-GEMM: Integrating Layout Propagation into GEMM Operations Authors:César Guedes Carneiro, Lucas Alvarenga, Guido Araujo, Sandro Rigo View a PDF of the paper titled LP-GEMM: Integrating Layout Propagation into GEMM Operations, by C\'esar Guedes Carneiro and 3 other authors View PDF HTML (experimental) Abstract:In Scientific Computing and modern Machine Learning (ML) workloads, sequences of dependent General Matrix Multiplications (GEMMs) often dominate execution time. While state-of-the-art BLAS libraries aggressively optimize individual GEMM calls, they remain constrained by the BLAS API, which requires each call to independently pack input matrices and restore outputs to a canonical memory layout. In sequential GEMMs, these constraints cause redundant packing and unpacking, wasting valuable computational resources. This paper introduces LP-GEMM, a decomposition of the GEMM kernel that enables packing-layout propagation across sequential GEMM operations. This approach eliminates unnecessary data repacking while preserving full BLAS semantic correctness at the boundaries. We evaluate LP-GEMM on x86 (AVX-512) and RISC-V (RVV 1.0) architectures across MLP-like and Attention-like workloads. Our results show average speedups of 2.25x over OpenBLAS on Intel x86 for sequential GEMMs and competitive gains relative to vendor-optimized libraries such as Intel MKL. ...

Originally published on April 07, 2026. Curated by AI News.

Related Articles

Machine Learning

CV advise needed

i have been working as an R&D ml engineer in my current company for about 10 months i have been trying to apply ro other jobs mainly ...

Reddit - ML Jobs · 1 min ·
Llms

Founding Engineer (Full-Stack / AI) – Build the Future of Personalized Healthcare (San Francisco, In-Person)

Hi everyone Galen AI is an early-stage, YC-backed healthtech startup building a personal AI doctor by combining clinical data, wearable d...

Reddit - ML Jobs · 1 min ·
Machine Learning

Seeking Advice: Struggling to Get Call-backs After Career Break (4 YOE in Computer Vision/Deep Learning)

I'm finding it incredibly difficult to get back into the job market after taking a career break for personal reasons, and I could really ...

Reddit - ML Jobs · 1 min ·
Llms

Looking for Job Opportunities — Senior MLOps / LLMOps Engineer (Remote / Visa Sponsorship)

Hi Everyone 👋 I’m a Senior MLOps / LLMOps Engineer with ~5 years of experience building and operating production-scale ML & LLM platf...

Reddit - ML Jobs · 1 min ·
More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime