[2603.18413] Statistical Testing Framework for Clustering Pipelines by

[2603.18413] Statistical Testing Framework for Clustering Pipelines by Selective Inference

arXiv - Machine Learning March 24, 2026 3 min read

About this article

Abstract page for arXiv paper 2603.18413: Statistical Testing Framework for Clustering Pipelines by Selective Inference

Statistics > Machine Learning arXiv:2603.18413 (stat) [Submitted on 19 Mar 2026 (v1), last revised 23 Mar 2026 (this version, v2)] Title:Statistical Testing Framework for Clustering Pipelines by Selective Inference Authors:Yugo Miyata, Tomohiro Shiraishi, Shunichi Nishino, Ichiro Takeuchi View a PDF of the paper titled Statistical Testing Framework for Clustering Pipelines by Selective Inference, by Yugo Miyata and 3 other authors View PDF HTML (experimental) Abstract:A data analysis pipeline is a structured sequence of steps that transforms raw data into meaningful insights by integrating multiple analysis algorithms. In many practical applications, analytical findings are obtained only after data pass through several data-dependent procedures within such pipelines. In this study, we address the problem of quantifying the statistical reliability of results produced by data analysis pipelines. As a proof of concept, we focus on clustering pipelines that identify cluster structures from complex and heterogeneous data through procedures such as outlier detection, feature selection, and clustering. We propose a novel statistical testing framework to assess the significance of clustering results obtained through these pipelines. Our framework, based on selective inference, enables the systematic construction of valid statistical tests for clustering pipelines composed of predefined components. We prove that the proposed test controls the type I error rate at any nominal level ...

Originally published on March 24, 2026. Curated by AI News.

Llms

[R] BraiNN: An Experimental Neural Architecture with Working Memory, Relational Reasoning, and Adaptive Learning

BraiNN An Experimental Neural Architecture with Working Memory, Relational Reasoning, and Adaptive Learning BraiNN is a compact research‑...

Reddit - Machine Learning · 1 min · 14 minutes ago

Machine Learning

[HIRING]Remote AI Training Jobs -Up to $1K/Week| Collaborators Wanted.USA

submitted by /u/nortonakenga [link] [comments]

Reddit - ML Jobs · 1 min · about 1 hour ago

Machine Learning

VulcanAMI Might Help

I open-sourced a large AI platform I built solo, working 16 hours a day, at my kitchen table, fueled by an inordinate degree of compulsio...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Machine Learning

[P] I tested Meta’s brain-response model on posts. It predicted the Elon one almost perfectly.

I built an experimental UI and visualization layer around Meta’s open brain-response model just to see whether this stuff actually works ...

Reddit - Machine Learning · 1 min · about 3 hours ago

[2603.18413] Statistical Testing Framework for Clustering Pipelines by Selective Inference

About this article

Related Articles

[R] BraiNN: An Experimental Neural Architecture with Working Memory, Relational Reasoning, and Adaptive Learning

[HIRING]Remote AI Training Jobs -Up to $1K/Week| Collaborators Wanted.USA

VulcanAMI Might Help

[P] I tested Meta’s brain-response model on posts. It predicted the Elon one almost perfectly.

No comments

Stay updated with AI News