Machine Learning Ai Infrastructure Data Science

[2602.21478] Efficient Inference after Directionally Stable Adaptive Experiments

arXiv - Machine Learning February 26, 2026 3 min read Article

Summary

This paper explores efficient inference methods for adaptive experiments, introducing the concept of directional stability, which enhances the reliability of estimators derived from adaptive data collection methods like bandit algorithms.

Why It Matters

The research addresses a significant gap in statistical methodology by providing conditions under which adaptive data collection can yield efficient estimators. This is crucial for fields utilizing adaptive experiments, such as machine learning and statistics, where accurate inference is essential for decision-making.

Key Takeaways

Introduces directional stability, a weaker condition than previously used stability criteria.
Demonstrates that estimators remain efficient under adaptive data collection.
Provides the first semiparametric efficiency guarantee for LinUCB sampling.
Establishes a convolution theorem for characterizing efficiency in adaptive data settings.
Highlights the martingale form of the canonical gradient for high-dimensional asymptotic normality.

Statistics > Machine Learning arXiv:2602.21478 (stat) [Submitted on 25 Feb 2026] Title:Efficient Inference after Directionally Stable Adaptive Experiments Authors:Zikai Shen, Houssam Zenati, Nathan Kallus, Arthur Gretton, Koulik Khamaru, Aurélien Bibaut View a PDF of the paper titled Efficient Inference after Directionally Stable Adaptive Experiments, by Zikai Shen and 5 other authors View PDF HTML (experimental) Abstract:We study inference on scalar-valued pathwise differentiable targets after adaptive data collection, such as a bandit algorithm. We introduce a novel target-specific condition, directional stability, which is strictly weaker than previously imposed target-agnostic stability conditions. Under directional stability, we show that estimators that would have been efficient under i.i.d. data remain asymptotically normal and semiparametrically efficient when computed from adaptively collected trajectories. The canonical gradient has a martingale form, and directional stability guarantees stabilization of its predictable quadratic variation, enabling high-dimensional asymptotic normality. We characterize efficiency using a convolution theorem for the adaptive-data setting, and give a condition under which the one-step estimator attains the efficiency bound. We verify directional stability for LinUCB, yielding the first semiparametric efficiency guarantee for a regular scalar target under LinUCB sampling. Comments: Subjects: Machine Learning (stat.ML); Machine Lear...

Read Original Article

Machine Learning

[P] Looking for people who have had training runs fail unexpectedly to beta test a stability monitor. Free, takes 5 minutes to add to your existing loop. DM me.

Anyone actively training models want to try a stability monitor on a real run? Trying to get real world validation outside my own benchma...

Reddit - Machine Learning · 1 min · 17 minutes ago

Llms

Is the Mirage Effect a bug, or is it Geometric Reconstruction in action? A framework for why VLMs perform better "hallucinating" than guessing, and what that may tell us about what's really inside these models

Last week, a team from Stanford and UCSF (Asadi, O'Sullivan, Fei-Fei Li, Euan Ashley et al.) dropped two companion papers. The first, MAR...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Machine Learning

Yupp shuts down after raising $33M from a16z crypto's Chris Dixon | TechCrunch

Less than a year after launching, with checks from some of the biggest names in Silicon Valley, crowdsourced AI model feedback startup Yu...

TechCrunch - AI · 4 min · about 4 hours ago

Machine Learning

[R] Fine-tuning services report

If you have some data and want to train or run a small custom model but don't have powerful enough hardware for training, fine-tuning ser...

Reddit - Machine Learning · 1 min · about 7 hours ago

[2602.21478] Efficient Inference after Directionally Stable Adaptive Experiments

Summary

Why It Matters

Key Takeaways

Related Articles

[P] Looking for people who have had training runs fail unexpectedly to beta test a stability monitor. Free, takes 5 minutes to add to your existing loop. DM me.

Is the Mirage Effect a bug, or is it Geometric Reconstruction in action? A framework for why VLMs perform better "hallucinating" than guessing, and what that may tell us about what's really inside these models

Yupp shuts down after raising $33M from a16z crypto's Chris Dixon | TechCrunch

[R] Fine-tuning services report

No comments

Stay updated with AI News