[2601.11471] Low-Rank Key Value Attention

[2601.11471] Low-Rank Key Value Attention

arXiv - Machine Learning 3 min read

About this article

Abstract page for arXiv paper 2601.11471: Low-Rank Key Value Attention

Computer Science > Machine Learning arXiv:2601.11471 (cs) [Submitted on 16 Jan 2026 (v1), last revised 7 Apr 2026 (this version, v3)] Title:Low-Rank Key Value Attention Authors:James O'Neill, Robert Clancy, Mariia Matskevichus, Fergal Reid View a PDF of the paper titled Low-Rank Key Value Attention, by James O'Neill and 3 other authors View PDF HTML (experimental) Abstract:The key-value (KV) cache is a primary memory bottleneck in Transformers. We propose Low-Rank Key-Value (LRKV) attention, which reduces KV cache memory by exploiting redundancy across attention heads, while being compute efficient. Each layer uses a shared full-rank KV projection augmented with low-rank, head-specific residuals, providing a continuous trade-off between complete sharing and full independence. After pretraining models of size 128M to 6.3B parameters, LRKV consistently achieves the lowest test loss among standard MHA, MQA/GQA, and MLA while using only 45-53\% of MHA's KV cache. LRKV reaches equivalent baseline quality 18-25\% faster (measured in training steps). After supervised midtraining, LRKV achieves the highest downstream task performance across ARC-Easy, ARC-Challenge, MMLU, GSM8K, and HumanEval benchmarks. Subjects: Machine Learning (cs.LG) Cite as: arXiv:2601.11471 [cs.LG]   (or arXiv:2601.11471v3 [cs.LG] for this version)   https://doi.org/10.48550/arXiv.2601.11471 Focus to learn more arXiv-issued DOI via DataCite Submission history From: James O'Neill [view email] [v1] Fri, 16 Jan...

Originally published on April 09, 2026. Curated by AI News.

Related Articles

Artificial intelligence for robots with human-inspired hands advances and expands machine learning capabilities in the new generation of robotics.
Machine Learning

Artificial intelligence for robots with human-inspired hands advances and expands machine learning capabilities in the new generation of robotics.

AI News - General · 10 min ·
Machine Learning

Academy and ASN Joint Task Force Publishes Artificial Intelligence and Machine Learning Resource Guide

AI News - General ·
Zambian Student Builds Machine Learning System to Help African Farmers Adapt to Climate Change
Machine Learning

Zambian Student Builds Machine Learning System to Help African Farmers Adapt to Climate Change

AI News - General · 6 min ·
Improving AI models’ ability to explain their predictions
Machine Learning

Improving AI models’ ability to explain their predictions

AI News - General · 9 min ·
More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime