[2603.12567] Foundation-Model Surrogates Enable Data-Efficient Active

[2603.12567] Foundation-Model Surrogates Enable Data-Efficient Active Learning for Materials Discovery

arXiv - Machine Learning March 25, 2026 4 min read

About this article

Abstract page for arXiv paper 2603.12567: Foundation-Model Surrogates Enable Data-Efficient Active Learning for Materials Discovery

Condensed Matter > Materials Science arXiv:2603.12567 (cond-mat) [Submitted on 13 Mar 2026 (v1), last revised 24 Mar 2026 (this version, v3)] Title:Foundation-Model Surrogates Enable Data-Efficient Active Learning for Materials Discovery Authors:Jeffrey Hu, Rongzhi Dong, Ying Feng, Ming Hu, Jianjun Hu View a PDF of the paper titled Foundation-Model Surrogates Enable Data-Efficient Active Learning for Materials Discovery, by Jeffrey Hu and 4 other authors View PDF HTML (experimental) Abstract:Active learning (AL) has emerged as a powerful paradigm for accelerating materials discovery by iteratively steering experiments toward promising candidates, reducing the number of costly synthesis-and-characterization cycles needed to identify optimal materials. However, current AL relies predominantly on Gaussian Process (GP) and Random Forest (RF) surrogates, which suffer from complementary limitations: GP underfits complex composition-property landscapes due to rigid kernel assumptions, while RF produces unreliable heuristic uncertainty estimates in small-data regimes. This small-data challenge is pervasive in materials science, making reliable surrogate modeling extremely difficult with models trained from scratch on each new dataset. Here we propose In-Context Active Learning (ICAL), which addresses this bottleneck by replacing conventional surrogates with TabPFN, a transformer-based foundation model (FM) pre-trained on millions of synthetic regression tasks to meta-learn a unive...

Originally published on March 25, 2026. Curated by AI News.

Machine Learning

Improving AI models’ ability to explain their predictions

AI News - General · 9 min · 6 minutes ago

Machine Learning

Auto agent - Self improving domain expertise agent

someone opensource an ai agent that autonomously upgraded itself to #1 across multiple domains in < 24 hours…. then open sourced the e...

Reddit - Artificial Intelligence · 1 min · 19 minutes ago

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 1 hour ago

Machine Learning

Tuskegee University to host the 2026 Amazon Web Services–Machine Learning University Research & Teaching Symposium

Tuskegee University will host the 2026 Amazon Web Services–Machine Learning University Spring AI/ML Teaching & Research Symposium on Febr...

AI News - General · 8 min · about 3 hours ago

[2603.12567] Foundation-Model Surrogates Enable Data-Efficient Active Learning for Materials Discovery

About this article

Related Articles

Improving AI models’ ability to explain their predictions

Auto agent - Self improving domain expertise agent

UMKC Announces New Master of Science in Artificial Intelligence

Tuskegee University to host the 2026 Amazon Web Services–Machine Learning University Research & Teaching Symposium

No comments

Stay updated with AI News