[2604.05225] fastml: Guarded Resampling Workflows for Safer Automated

[2604.05225] fastml: Guarded Resampling Workflows for Safer Automated Machine Learning in R

arXiv - Machine Learning April 08, 2026 3 min read

About this article

Abstract page for arXiv paper 2604.05225: fastml: Guarded Resampling Workflows for Safer Automated Machine Learning in R

Statistics > Computation arXiv:2604.05225 (stat) [Submitted on 6 Apr 2026] Title:fastml: Guarded Resampling Workflows for Safer Automated Machine Learning in R Authors:Selcuk Korkmaz, Dincer Goksuluk, Eda Karaismailoglu View a PDF of the paper titled fastml: Guarded Resampling Workflows for Safer Automated Machine Learning in R, by Selcuk Korkmaz and 2 other authors View PDF HTML (experimental) Abstract:Preprocessing leakage arises when scaling, imputation, or other data-dependent transformations are estimated before resampling, inflating apparent performance while remaining hard to detect. We present fastml, an R package that provides a single-call interface for leakage-aware machine learning through guarded resampling, where preprocessing is re-estimated inside each resample and applied to the corresponding assessment data. The package supports grouped and time-ordered resampling, blocks high-risk configurations, audits recipes for external dependencies, and includes sandboxed execution and integrated model explanation. We evaluate fastml with a Monte Carlo simulation contrasting global and fold-local normalization, a usability comparison with tidymodels under matched specifications, and survival benchmarks across datasets of different sizes. The simulation demonstrates that global preprocessing substantially inflates apparent performance relative to guarded resampling. fastml matched held-out performance obtained with tidymodels while reducing workflow orchestration, an...

Originally published on April 08, 2026. Curated by AI News.

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 1 hour ago

Machine Learning

Accelerating science with AI and simulations

MIT Professor Rafael Gómez-Bombarelli discusses the transformative potential of AI in scientific research, emphasizing its role in materi...

AI News - General · 10 min · about 1 hour ago

Machine Learning

Weird ICML decision [D]

Hello, A friend of mine had a paper with borderline scores accepted at ICML. However, the comment made by the meta reviewers feels like t...

Reddit - Machine Learning · 1 min · about 1 hour ago

Machine Learning

[2603.13566] EmDT: Embedding Diffusion Transformer for Tabular Data Generation in Fraud Detection

Abstract page for arXiv paper 2603.13566: EmDT: Embedding Diffusion Transformer for Tabular Data Generation in Fraud Detection

arXiv - Machine Learning · 3 min · about 3 hours ago

[2604.05225] fastml: Guarded Resampling Workflows for Safer Automated Machine Learning in R

About this article

Related Articles

UMKC Announces New Master of Science in Artificial Intelligence

Accelerating science with AI and simulations

Weird ICML decision [D]

[2603.13566] EmDT: Embedding Diffusion Transformer for Tabular Data Generation in Fraud Detection

No comments

Stay updated with AI News