[2406.01825] Reliable OOD Virtual Screening with Extrapolatory

[2406.01825] Reliable OOD Virtual Screening with Extrapolatory Pseudo-Label Matching

arXiv - AI March 25, 2026 4 min read

About this article

Abstract page for arXiv paper 2406.01825: Reliable OOD Virtual Screening with Extrapolatory Pseudo-Label Matching

Computer Science > Machine Learning arXiv:2406.01825 (cs) [Submitted on 3 Jun 2024 (v1), last revised 24 Mar 2026 (this version, v5)] Title:Reliable OOD Virtual Screening with Extrapolatory Pseudo-Label Matching Authors:Yunni Qu (1), Bhargav Vaduri (1), Karthikeya Jatoth (1), James Wellnitz (2), Dzung Dinh (1), Seth Veenbaas (2), Jonathan Chapman (2), Alexander Tropsha (2), Junier Oliva (1) ((1) Department of Computer Science, University of North Carolina at Chapel Hill, (2) Eshelman School of Pharmacy, University of North Carolina at Chapel Hill) View a PDF of the paper titled Reliable OOD Virtual Screening with Extrapolatory Pseudo-Label Matching, by Yunni Qu (1) and 11 other authors View PDF HTML (experimental) Abstract:Machine learning (ML) models are increasingly deployed for virtual screening in drug discovery, where the goal is to identify novel, chemically diverse scaffolds while minimizing experimental costs. This creates a fundamental challenge: the most valuable discoveries lie in out-of-distribution (OOD) regions beyond the training data, yet ML models often degrade under distribution shift. Standard novelty-rejection strategies ensure reliability within the training domain but limit discovery by rejecting precisely the novel scaffolds most worth finding. Moreover, experimental budgets permit testing only a small fraction of nominated candidates, demanding models that produce reliable confidence estimates. We introduce EXPLOR (Extrapolatory Pseudo-Label Matchin...

Originally published on March 25, 2026. Curated by AI News.

Machine Learning

[R] ICML Anonymized git repos for rebuttal

A number of the papers I'm reviewing for have submitted additional figures and code through anonymized git repos (e.g. https://anonymous....

Reddit - Machine Learning · 1 min · about 1 hour ago

Llms

[R] Reference model free behavioral discovery of AudiBench model organisms via Probe-Mediated Adaptive Auditing

Anthropic's AuditBench - 56 Llama 3.3 70B models with planted hidden behaviors - their best agent detects the behaviros 10-13% of the tim...

Reddit - Machine Learning · 1 min · about 1 hour ago

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 1 hour ago

Llms

[P] Dante-2B: I'm training a 2.1B bilingual fully open Italian/English LLM from scratch on 2×H200. Phase 1 done — here's what I've built.

The problem If you work with Italian text and local models, you know the pain. Every open-source LLM out there treats Italian as an after...

Reddit - Machine Learning · 1 min · about 2 hours ago

[2406.01825] Reliable OOD Virtual Screening with Extrapolatory Pseudo-Label Matching

About this article

Related Articles

[R] ICML Anonymized git repos for rebuttal

[R] Reference model free behavioral discovery of AudiBench model organisms via Probe-Mediated Adaptive Auditing

UMKC Announces New Master of Science in Artificial Intelligence

[P] Dante-2B: I'm training a 2.1B bilingual fully open Italian/English LLM from scratch on 2×H200. Phase 1 done — here's what I've built.

No comments

Stay updated with AI News