[2603.18413] Statistical Testing Framework for Clustering Pipelines by Selective Inference

[2603.18413] Statistical Testing Framework for Clustering Pipelines by Selective Inference

arXiv - Machine Learning 3 min read

About this article

Abstract page for arXiv paper 2603.18413: Statistical Testing Framework for Clustering Pipelines by Selective Inference

Statistics > Machine Learning arXiv:2603.18413 (stat) [Submitted on 19 Mar 2026 (v1), last revised 23 Mar 2026 (this version, v2)] Title:Statistical Testing Framework for Clustering Pipelines by Selective Inference Authors:Yugo Miyata, Tomohiro Shiraishi, Shunichi Nishino, Ichiro Takeuchi View a PDF of the paper titled Statistical Testing Framework for Clustering Pipelines by Selective Inference, by Yugo Miyata and 3 other authors View PDF HTML (experimental) Abstract:A data analysis pipeline is a structured sequence of steps that transforms raw data into meaningful insights by integrating multiple analysis algorithms. In many practical applications, analytical findings are obtained only after data pass through several data-dependent procedures within such pipelines. In this study, we address the problem of quantifying the statistical reliability of results produced by data analysis pipelines. As a proof of concept, we focus on clustering pipelines that identify cluster structures from complex and heterogeneous data through procedures such as outlier detection, feature selection, and clustering. We propose a novel statistical testing framework to assess the significance of clustering results obtained through these pipelines. Our framework, based on selective inference, enables the systematic construction of valid statistical tests for clustering pipelines composed of predefined components. We prove that the proposed test controls the type I error rate at any nominal level ...

Originally published on March 24, 2026. Curated by AI News.

Related Articles

Llms

[R] BraiNN: An Experimental Neural Architecture with Working Memory, Relational Reasoning, and Adaptive Learning

BraiNN An Experimental Neural Architecture with Working Memory, Relational Reasoning, and Adaptive Learning BraiNN is a compact research‑...

Reddit - Machine Learning · 1 min ·
Machine Learning

[HIRING]Remote AI Training Jobs -Up to $1K/Week| Collaborators Wanted.USA

submitted by /u/nortonakenga [link] [comments]

Reddit - ML Jobs · 1 min ·
Machine Learning

VulcanAMI Might Help

I open-sourced a large AI platform I built solo, working 16 hours a day, at my kitchen table, fueled by an inordinate degree of compulsio...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

[P] I tested Meta’s brain-response model on posts. It predicted the Elon one almost perfectly.

I built an experimental UI and visualization layer around Meta’s open brain-response model just to see whether this stuff actually works ...

Reddit - Machine Learning · 1 min ·
More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime