[2510.07746] t-SNE Exaggerates Clusters, Provably

[2510.07746] t-SNE Exaggerates Clusters, Provably

arXiv - Machine Learning 3 min read

About this article

Abstract page for arXiv paper 2510.07746: t-SNE Exaggerates Clusters, Provably

Computer Science > Machine Learning arXiv:2510.07746 (cs) [Submitted on 9 Oct 2025 (v1), last revised 2 Mar 2026 (this version, v2)] Title:t-SNE Exaggerates Clusters, Provably Authors:Noah Bergam, Szymon Snoeck, Nakul Verma View a PDF of the paper titled t-SNE Exaggerates Clusters, Provably, by Noah Bergam and 2 other authors View PDF HTML (experimental) Abstract:Central to the widespread use of t-distributed stochastic neighbor embedding (t-SNE) is the conviction that it produces visualizations whose structure roughly matches that of the input. To the contrary, we prove that (1) the strength of the input clustering, and (2) the extremity of outlier points, cannot be reliably inferred from the t-SNE output. We demonstrate the prevalence of these failure modes in practice as well. Comments: Subjects: Machine Learning (cs.LG) Cite as: arXiv:2510.07746 [cs.LG]   (or arXiv:2510.07746v2 [cs.LG] for this version)   https://doi.org/10.48550/arXiv.2510.07746 Focus to learn more arXiv-issued DOI via DataCite Submission history From: Noah Bergam [view email] [v1] Thu, 9 Oct 2025 03:34:36 UTC (4,411 KB) [v2] Mon, 2 Mar 2026 05:39:53 UTC (5,452 KB) Full-text links: Access Paper: View a PDF of the paper titled t-SNE Exaggerates Clusters, Provably, by Noah Bergam and 2 other authorsView PDFHTML (experimental)TeX Source view license Current browse context: cs.LG < prev   |   next > new | recent | 2025-10 Change to browse by: cs References & Citations NASA ADSGoogle Scholar Semantic Schol...

Originally published on March 03, 2026. Curated by AI News.

Related Articles

[2601.13227] Insider Knowledge: How Much Can RAG Systems Gain from Evaluation Secrets?
Llms

[2601.13227] Insider Knowledge: How Much Can RAG Systems Gain from Evaluation Secrets?

Abstract page for arXiv paper 2601.13227: Insider Knowledge: How Much Can RAG Systems Gain from Evaluation Secrets?

arXiv - AI · 3 min ·
[2601.22440] AI and My Values: User Perceptions of LLMs' Ability to Extract, Embody, and Explain Human Values from Casual Conversations
Llms

[2601.22440] AI and My Values: User Perceptions of LLMs' Ability to Extract, Embody, and Explain Human Values from Casual Conversations

Abstract page for arXiv paper 2601.22440: AI and My Values: User Perceptions of LLMs' Ability to Extract, Embody, and Explain Human Value...

arXiv - AI · 4 min ·
[2601.13222] Incorporating Q&A Nuggets into Retrieval-Augmented Generation
Nlp

[2601.13222] Incorporating Q&A Nuggets into Retrieval-Augmented Generation

Abstract page for arXiv paper 2601.13222: Incorporating Q&A Nuggets into Retrieval-Augmented Generation

arXiv - AI · 3 min ·
[2512.01707] StreamGaze: Gaze-Guided Temporal Reasoning and Proactive Understanding in Streaming Videos
Llms

[2512.01707] StreamGaze: Gaze-Guided Temporal Reasoning and Proactive Understanding in Streaming Videos

Abstract page for arXiv paper 2512.01707: StreamGaze: Gaze-Guided Temporal Reasoning and Proactive Understanding in Streaming Videos

arXiv - AI · 4 min ·
More in Nlp: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime