[2510.18900] Foundation Models for Discovery and Exploration in

[2510.18900] Foundation Models for Discovery and Exploration in Chemical Space

arXiv - Machine Learning May 04, 2026 4 min read

About this article

Abstract page for arXiv paper 2510.18900: Foundation Models for Discovery and Exploration in Chemical Space

Physics > Chemical Physics arXiv:2510.18900 (physics) [Submitted on 20 Oct 2025 (v1), last revised 1 May 2026 (this version, v2)] Title:Foundation Models for Discovery and Exploration in Chemical Space Authors:Alexius Wadell, Anoushka Bhutani, Victor Azumah, Austin R. Ellis-Mohr, Andrew J. Stier, Kareem Hegazy, Alexander Brace, Hancheng Zhao, Celia Kelly, Anuj K. Nayak, Yuhan Chen, Dimitrios Simatos, Hongyi Lin, Murali Emani, Venkatram Vishwanath, Kevin Gering, Melisa Alkan, Tom Gibbs, Jack Wells, Wesley W. Qian, Richard C. Gerkin, Benjamin Amorelli, Alexander B. Wiltschko, Lav R. Varshney, Bharath Ramsundar, Karthik Duraisamy, Michael W. Mahoney, Arvind Ramanathan, Venkatasubramanian Viswanathan View a PDF of the paper titled Foundation Models for Discovery and Exploration in Chemical Space, by Alexius Wadell and 28 other authors View PDF Abstract:Accurate prediction of atomistic, thermodynamic, and kinetic properties from molecular structures underpins materials innovation. Existing computational and experimental approaches lack the scalability required to navigate chemical space efficiently. Scientific foundation models trained on large unlabelled datasets offer a path towards navigating chemical space across application domains. Here, we develop MIST, a family of molecular foundation models with up to an order of magnitude more parameters and data than prior works. Trained using a novel tokenizer, Smirk, which comprehensively captures nuclear, electronic, and geometric...

Originally published on May 04, 2026. Curated by AI News.

Llms

The recursive self, explained

looking for anyone to give any critiques or tell me that something here is incorrect. this is the work of a year how I scaffold on a true...

Reddit - Artificial Intelligence · 1 min · 43 minutes ago

Llms

Excellent discussion about LLM scaling [D]

I came across an excellent in depth discussion of memory and compute scaling analysis for LLMs. One takeaway is that running LLMs locally...

Reddit - Machine Learning · 1 min · about 3 hours ago

Llms

[2602.03216] Token Sparse Attention: Efficient Long-Context Inference with Interleaved Token Selection

Abstract page for arXiv paper 2602.03216: Token Sparse Attention: Efficient Long-Context Inference with Interleaved Token Selection

arXiv - Machine Learning · 4 min · about 4 hours ago

Llms

[2601.21214] Scaling Reasoning Hop Exposes Weaknesses: Demystifying and Improving Hop Generalization in Large Language Models

Abstract page for arXiv paper 2601.21214: Scaling Reasoning Hop Exposes Weaknesses: Demystifying and Improving Hop Generalization in Larg...

arXiv - Machine Learning · 4 min · about 4 hours ago

[2510.18900] Foundation Models for Discovery and Exploration in Chemical Space

About this article

Related Articles

The recursive self, explained

Excellent discussion about LLM scaling [D]

[2602.03216] Token Sparse Attention: Efficient Long-Context Inference with Interleaved Token Selection

[2601.21214] Scaling Reasoning Hop Exposes Weaknesses: Demystifying and Improving Hop Generalization in Large Language Models

No comments

Stay updated with AI News