[2604.02651] Communication-free Sampling and 4D Hybrid Parallelism for

[2604.02651] Communication-free Sampling and 4D Hybrid Parallelism for Scalable Mini-batch GNN Training

arXiv - AI April 06, 2026 4 min read

About this article

Abstract page for arXiv paper 2604.02651: Communication-free Sampling and 4D Hybrid Parallelism for Scalable Mini-batch GNN Training

Computer Science > Machine Learning arXiv:2604.02651 (cs) [Submitted on 3 Apr 2026] Title:Communication-free Sampling and 4D Hybrid Parallelism for Scalable Mini-batch GNN Training Authors:Cunyang Wei, Siddharth Singh, Aishwarya Sarkar, Daniel Nichols, Tisha Patel, Aditya K. Ranjan, Sayan Ghosh, Ali Jannesari, Nathan R. Tallent, Abhinav Bhatele View a PDF of the paper titled Communication-free Sampling and 4D Hybrid Parallelism for Scalable Mini-batch GNN Training, by Cunyang Wei and 9 other authors View PDF HTML (experimental) Abstract:Graph neural networks (GNNs) are widely used for learning on graph datasets derived from various real-world scenarios. Learning from extremely large graphs requires distributed training, and mini-batching with sampling is a popular approach for parallelizing GNN training. Existing distributed mini-batch approaches have significant performance bottlenecks due to expensive sampling methods and limited scaling when using data parallelism. In this work, we present ScaleGNN, a 4D parallel framework for scalable mini-batch GNN training that combines communication-free distributed sampling, 3D parallel matrix multiplication (PMM), and data parallelism. ScaleGNN introduces a uniform vertex sampling algorithm, enabling each process (GPU device) to construct its local mini-batch, i.e., subgraph partitions without any inter-process communication. 3D PMM enables scaling mini-batch training to much larger GPU counts than vanilla data parallelism with si...

Originally published on April 06, 2026. Curated by AI News.

Machine Learning

[For Hire] Ex-Microsoft Senior Data Engineer | Databricks, Palantir Foundry, MLOps | $55/hr

submitted by /u/mcheetirala2510 [link] [comments]

Reddit - ML Jobs · 1 min · about 1 hour ago

Machine Learning

Meta AI app climbs to No. 5 on the App Store after Muse Spark launch | TechCrunch

The app was ranking No. 57 on the App Store just before Meta AI's new model launched. Now it's No. 5 — and rising.

TechCrunch - AI · 4 min · about 3 hours ago

Machine Learning

Detecting mirrored selfie images: OCR the best way? [D]

I'm trying to catch backwards "selfie" images before passing them to our VLM text reader and/or face embedding extraction. Since models l...

Reddit - Machine Learning · 1 min · about 3 hours ago

Llms

Google’s Gemini AI can answer your questions with 3D models and simulations

submitted by /u/tekz [link] [comments]

Reddit - Artificial Intelligence · 1 min · about 4 hours ago

[2604.02651] Communication-free Sampling and 4D Hybrid Parallelism for Scalable Mini-batch GNN Training

About this article

Related Articles

[For Hire] Ex-Microsoft Senior Data Engineer | Databricks, Palantir Foundry, MLOps | $55/hr

Meta AI app climbs to No. 5 on the App Store after Muse Spark launch | TechCrunch

Detecting mirrored selfie images: OCR the best way? [D]

Google’s Gemini AI can answer your questions with 3D models and simulations

No comments

Stay updated with AI News