[2602.12606] RelBench v2: A Large-Scale Benchmark and Repository for Relational Data

[2602.12606] RelBench v2: A Large-Scale Benchmark and Repository for Relational Data

arXiv - Machine Learning 4 min read Article

Summary

RelBench v2 introduces a comprehensive benchmark for relational deep learning, featuring 11 datasets and new predictive tasks, enhancing evaluation methods in relational data modeling.

Why It Matters

As relational deep learning evolves, having robust benchmarks like RelBench v2 is crucial for systematic evaluation and advancement in the field. This resource aids researchers in assessing model performance across diverse relational datasets, fostering innovation and improving applications in various sectors, from academia to enterprise.

Key Takeaways

  • RelBench v2 expands to 11 datasets with over 22 million rows, enhancing relational data evaluation.
  • Introduces autocomplete tasks for predicting missing values in relational tables, moving beyond traditional SQL forecasting.
  • Demonstrates that relational deep learning models outperform single-table baselines in various predictive tasks.

Computer Science > Machine Learning arXiv:2602.12606 (cs) [Submitted on 13 Feb 2026] Title:RelBench v2: A Large-Scale Benchmark and Repository for Relational Data Authors:Justin Gu, Rishabh Ranjan, Charilaos Kanatsoulis, Haiming Tang, Martin Jurkovic, Valter Hudovernik, Mark Znidar, Pranshu Chaturvedi, Parth Shroff, Fengyu Li, Jure Leskovec View a PDF of the paper titled RelBench v2: A Large-Scale Benchmark and Repository for Relational Data, by Justin Gu and 10 other authors View PDF HTML (experimental) Abstract:Relational deep learning (RDL) has emerged as a powerful paradigm for learning directly on relational databases by modeling entities and their relationships across multiple interconnected tables. As this paradigm evolves toward larger models and relational foundation models, scalable and realistic benchmarks are essential for enabling systematic evaluation and progress. In this paper, we introduce RelBench v2, a major expansion of the RelBench benchmark for RDL. RelBench v2 adds four large-scale relational datasets spanning scholarly publications, enterprise resource planning, consumer platforms, and clinical records, increasing the benchmark to 11 datasets comprising over 22 million rows across 29 tables. We further introduce autocomplete tasks, a new class of predictive objectives that require models to infer missing attribute values directly within relational tables while respecting temporal constraints, expanding beyond traditional forecasting tasks constructe...

Related Articles

Llms

OpenClaw security checklist: practical safeguards for AI agents

Here is one of the better quality guides on the ensuring safety when deploying OpenClaw: https://chatgptguide.ai/openclaw-security-checkl...

Reddit - Artificial Intelligence · 1 min ·
I let Gemini in Google Maps plan my day and it went surprisingly well | The Verge
Llms

I let Gemini in Google Maps plan my day and it went surprisingly well | The Verge

Gemini in Google Maps is a surprisingly useful way to explore new territory.

The Verge - AI · 11 min ·
Llms

The person who replaces you probably won't be AI. It'll be someone from the next department over who learned to use it - opinion/discussion

I'm a strategy person by background. Two years ago I'd write a recommendation and hand it to a product team. Now.. I describe what I want...

Reddit - Artificial Intelligence · 1 min ·
Block Resets Management With AI As Cash App Adds Installment Transfers
Llms

Block Resets Management With AI As Cash App Adds Installment Transfers

Block (NYSE:XYZ) plans a permanent organizational overhaul that replaces many middle management roles with AI-driven models to create fla...

AI Tools & Products · 5 min ·
More in Llms: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime