[2206.02088] LOCO Feature Importance Inference without Data Splitting

[2206.02088] LOCO Feature Importance Inference without Data Splitting via Minipatch Ensembles

arXiv - Machine Learning March 24, 2026 4 min read

About this article

Abstract page for arXiv paper 2206.02088: LOCO Feature Importance Inference without Data Splitting via Minipatch Ensembles

Statistics > Machine Learning arXiv:2206.02088 (stat) [Submitted on 5 Jun 2022 (v1), last revised 23 Mar 2026 (this version, v3)] Title:LOCO Feature Importance Inference without Data Splitting via Minipatch Ensembles Authors:Luqin Gan, Lili Zheng, Genevera I. Allen View a PDF of the paper titled LOCO Feature Importance Inference without Data Splitting via Minipatch Ensembles, by Luqin Gan and 2 other authors View PDF Abstract:Feature importance inference is critical for the interpretability and reliability of machine learning models. There has been increasing interest in developing model-agnostic approaches to interpret any predictive model, often in the form of feature occlusion or leave-one-covariate-out (LOCO) inference. Existing methods typically make limiting distributional assumptions, modeling assumptions, and require data splitting. In this work, we develop a novel, mostly model-agnostic, and distribution-free inference framework for feature importance in regression or classification tasks that does not require data splitting. Our approach leverages a form of random observation and feature subsampling called minipatch ensembles; it utilizes the trained ensembles for inference and requires no model-refitting or held-out test data after training. We show that our approach enjoys both computational and statistical efficiency as well as circumvents interpretational challenges with data splitting. Further, despite using the same data for training and inference, we show ...

Originally published on March 24, 2026. Curated by AI News.

Llms

[P] I built an autonomous ML agent that runs experiments on tabular data indefinitely - inspired by Karpathy's AutoResearch

Inspired by Andrej Karpathy's AutoResearch, I built a system where Claude Code acts as an autonomous ML researcher on tabular binary clas...

Reddit - Machine Learning · 1 min · 34 minutes ago

Machine Learning

[D] Data curation and targeted replacement as a pre-training alignment and controllability method

Hi, r/MachineLearning: has much research been done in large-scale training scenarios where undesirable data has been replaced before trai...

Reddit - Machine Learning · 1 min · 34 minutes ago

Llms

[R] BraiNN: An Experimental Neural Architecture with Working Memory, Relational Reasoning, and Adaptive Learning

BraiNN An Experimental Neural Architecture with Working Memory, Relational Reasoning, and Adaptive Learning BraiNN is a compact research‑...

Reddit - Machine Learning · 1 min · about 2 hours ago

Machine Learning

[HIRING]Remote AI Training Jobs -Up to $1K/Week| Collaborators Wanted.USA

submitted by /u/nortonakenga [link] [comments]

Reddit - ML Jobs · 1 min · about 3 hours ago

[2206.02088] LOCO Feature Importance Inference without Data Splitting via Minipatch Ensembles

About this article

Related Articles

[P] I built an autonomous ML agent that runs experiments on tabular data indefinitely - inspired by Karpathy's AutoResearch

[D] Data curation and targeted replacement as a pre-training alignment and controllability method

[R] BraiNN: An Experimental Neural Architecture with Working Memory, Relational Reasoning, and Adaptive Learning

[HIRING]Remote AI Training Jobs -Up to $1K/Week| Collaborators Wanted.USA

No comments

Stay updated with AI News