[2406.01825] Reliable OOD Virtual Screening with Extrapolatory Pseudo-Label Matching

[2406.01825] Reliable OOD Virtual Screening with Extrapolatory Pseudo-Label Matching

arXiv - AI 4 min read

About this article

Abstract page for arXiv paper 2406.01825: Reliable OOD Virtual Screening with Extrapolatory Pseudo-Label Matching

Computer Science > Machine Learning arXiv:2406.01825 (cs) [Submitted on 3 Jun 2024 (v1), last revised 24 Mar 2026 (this version, v5)] Title:Reliable OOD Virtual Screening with Extrapolatory Pseudo-Label Matching Authors:Yunni Qu (1), Bhargav Vaduri (1), Karthikeya Jatoth (1), James Wellnitz (2), Dzung Dinh (1), Seth Veenbaas (2), Jonathan Chapman (2), Alexander Tropsha (2), Junier Oliva (1) ((1) Department of Computer Science, University of North Carolina at Chapel Hill, (2) Eshelman School of Pharmacy, University of North Carolina at Chapel Hill) View a PDF of the paper titled Reliable OOD Virtual Screening with Extrapolatory Pseudo-Label Matching, by Yunni Qu (1) and 11 other authors View PDF HTML (experimental) Abstract:Machine learning (ML) models are increasingly deployed for virtual screening in drug discovery, where the goal is to identify novel, chemically diverse scaffolds while minimizing experimental costs. This creates a fundamental challenge: the most valuable discoveries lie in out-of-distribution (OOD) regions beyond the training data, yet ML models often degrade under distribution shift. Standard novelty-rejection strategies ensure reliability within the training domain but limit discovery by rejecting precisely the novel scaffolds most worth finding. Moreover, experimental budgets permit testing only a small fraction of nominated candidates, demanding models that produce reliable confidence estimates. We introduce EXPLOR (Extrapolatory Pseudo-Label Matchin...

Originally published on March 25, 2026. Curated by AI News.

Related Articles

Machine Learning

[R] ICML Anonymized git repos for rebuttal

A number of the papers I'm reviewing for have submitted additional figures and code through anonymized git repos (e.g. https://anonymous....

Reddit - Machine Learning · 1 min ·
Llms

[R] Reference model free behavioral discovery of AudiBench model organisms via Probe-Mediated Adaptive Auditing

Anthropic's AuditBench - 56 Llama 3.3 70B models with planted hidden behaviors - their best agent detects the behaviros 10-13% of the tim...

Reddit - Machine Learning · 1 min ·
UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Llms

[P] Dante-2B: I'm training a 2.1B bilingual fully open Italian/English LLM from scratch on 2×H200. Phase 1 done — here's what I've built.

The problem If you work with Italian text and local models, you know the pain. Every open-source LLM out there treats Italian as an after...

Reddit - Machine Learning · 1 min ·
More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime