[2602.00079] Embedding Compression via Spherical Coordinates
Nlp

[2602.00079] Embedding Compression via Spherical Coordinates

arXiv - Machine Learning 3 min read

About this article

Abstract page for arXiv paper 2602.00079: Embedding Compression via Spherical Coordinates

Computer Science > Machine Learning arXiv:2602.00079 (cs) [Submitted on 22 Jan 2026 (v1), last revised 25 Mar 2026 (this version, v4)] Title:Embedding Compression via Spherical Coordinates Authors:Han Xiao View a PDF of the paper titled Embedding Compression via Spherical Coordinates, by Han Xiao View PDF HTML (experimental) Abstract:We present an $\epsilon$-bounded compression method for unit-norm embeddings that achieves 1.5$\times$ compression, 25% better than the best prior lossless method. The method exploits that spherical coordinates of high-dimensional unit vectors concentrate around $\pi/2$, causing IEEE 754 exponents to collapse to a single value and high-order mantissa bits to become predictable, enabling entropy coding of both. Reconstruction error is bounded by float32 machine epsilon ($1.19 \times 10^{-7}$), making reconstructed values indistinguishable from originals at float32 precision. Evaluation across 26 configurations spanning text, image, and multi-vector embeddings confirms consistent compression improvement with zero measurable retrieval degradation on BEIR benchmarks. Comments: Subjects: Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV) MSC classes: 68T50 ACM classes: I.2.7 Cite as: arXiv:2602.00079 [cs.LG]   (or arXiv:2602.00079v4 [cs.LG] for this version)   https://doi.org/10.48550/arXiv.2602.00079 Focus to learn more arXiv-issued DOI via DataCite Submission history From: Han Xiao [view email] [v1] Thu, 22 Jan 2026 03:21:0...

Originally published on March 27, 2026. Curated by AI News.

Related Articles

Machine Learning

[D] Looking for definition of open-world ish learning problem

Hello! Recently I did a project where I initially had around 30 target classes. But at inference, the model had to be able to handle a lo...

Reddit - Machine Learning · 1 min ·
[2603.11687] SemBench: A Universal Semantic Framework for LLM Evaluation
Llms

[2603.11687] SemBench: A Universal Semantic Framework for LLM Evaluation

Abstract page for arXiv paper 2603.11687: SemBench: A Universal Semantic Framework for LLM Evaluation

arXiv - AI · 4 min ·
[2603.11583] UtilityMax Prompting: A Formal Framework for Multi-Objective Large Language Model Optimization
Llms

[2603.11583] UtilityMax Prompting: A Formal Framework for Multi-Objective Large Language Model Optimization

Abstract page for arXiv paper 2603.11583: UtilityMax Prompting: A Formal Framework for Multi-Objective Large Language Model Optimization

arXiv - AI · 3 min ·
[2512.05245] STAR-GO: Improving Protein Function Prediction by Learning to Hierarchically Integrate Ontology-Informed Semantic Embeddings
Machine Learning

[2512.05245] STAR-GO: Improving Protein Function Prediction by Learning to Hierarchically Integrate Ontology-Informed Semantic Embeddings

Abstract page for arXiv paper 2512.05245: STAR-GO: Improving Protein Function Prediction by Learning to Hierarchically Integrate Ontology...

arXiv - Machine Learning · 4 min ·
More in Nlp: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime