AI Infrastructure

GPUs, training clusters, MLOps, and deployment

Top This Week

Llms

The open-source AI system that beat Claude Sonnet on a $500 GPU just shipped a coding assistant

A week or two ago, an open-source project called ATLAS made the rounds for scoring 74.6% on LiveCodeBench with a frozen 9B model on a sin...

Reddit - Artificial Intelligence · 1 min ·
Ai Infrastructure

I built an AI content engine that turns one piece of content into posts for 9 platforms — fully automated with n8n

What it does: You give it any input — a blog URL, a YouTube video, raw text, or just a topic — and it generates optimized posts for 9 pla...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

mining hardware doing AI training - is the output actually useful

there's this network that launched recently routing crypto mining hardware toward AI training workloads. miners seem happy with the econo...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2602.19622] VecFormer: Towards Efficient and Generalizable Graph Transformer with Graph Token Attention
Machine Learning

[2602.19622] VecFormer: Towards Efficient and Generalizable Graph Transformer with Graph Token Attention

VecFormer introduces a novel Graph Transformer model that enhances efficiency and generalization in node classification, addressing compu...

arXiv - AI · 4 min ·
[2602.19610] Variational Inference for Bayesian MIDAS Regression
Machine Learning

[2602.19610] Variational Inference for Bayesian MIDAS Regression

This paper presents a Coordinate Ascent Variational Inference (CAVI) algorithm for Bayesian MIDAS regression, demonstrating significant s...

arXiv - Machine Learning · 4 min ·
[2602.19594] ISO-Bench: Can Coding Agents Optimize Real-World Inference Workloads?
Llms

[2602.19594] ISO-Bench: Can Coding Agents Optimize Real-World Inference Workloads?

ISO-Bench introduces a benchmark for coding agents to optimize real-world inference workloads, evaluating their performance against exper...

arXiv - Machine Learning · 3 min ·
[2602.19580] Leap+Verify: Regime-Adaptive Speculative Weight Prediction for Accelerating Neural Network Training
Llms

[2602.19580] Leap+Verify: Regime-Adaptive Speculative Weight Prediction for Accelerating Neural Network Training

The paper introduces Leap+Verify, a framework that enhances neural network training through speculative weight prediction, adapting to di...

arXiv - Machine Learning · 4 min ·
[2602.18568] RPU -- A Reasoning Processing Unit
Llms

[2602.18568] RPU -- A Reasoning Processing Unit

The paper introduces the Reasoning Processing Unit (RPU), a novel chiplet-based architecture designed to overcome memory bandwidth limita...

arXiv - AI · 3 min ·
[2602.18532] VLANeXt: Recipes for Building Strong VLA Models
Llms

[2602.18532] VLANeXt: Recipes for Building Strong VLA Models

The paper presents VLANeXt, a framework for building effective Vision-Language-Action (VLA) models, addressing inconsistencies in trainin...

arXiv - AI · 4 min ·
[2602.19498] Softmax is not Enough (for Adaptive Conformal Classification)
Nlp

[2602.19498] Softmax is not Enough (for Adaptive Conformal Classification)

The paper critiques the reliance on softmax outputs in adaptive conformal classification, proposing a new method that utilizes pre-softma...

arXiv - AI · 4 min ·
[2602.19489] Federated Learning Playground
Machine Learning

[2602.19489] Federated Learning Playground

The article presents the Federated Learning Playground, an interactive platform designed to teach core concepts of Federated Learning thr...

arXiv - AI · 3 min ·
[2602.18520] Sketch2Feedback: Grammar-in-the-Loop Framework for Rubric-Aligned Feedback on Student STEM Diagrams
Machine Learning

[2602.18520] Sketch2Feedback: Grammar-in-the-Loop Framework for Rubric-Aligned Feedback on Student STEM Diagrams

The paper presents Sketch2Feedback, a framework that enhances feedback on student-drawn STEM diagrams by integrating grammar rules to red...

arXiv - AI · 4 min ·
[2602.19414] Federated Causal Representation Learning in State-Space Systems for Decentralized Counterfactual Reasoning
Machine Learning

[2602.19414] Federated Causal Representation Learning in State-Space Systems for Decentralized Counterfactual Reasoning

This paper presents a federated framework for causal representation learning in state-space systems, enabling decentralized counterfactua...

arXiv - Machine Learning · 3 min ·
[2602.18511] Beyond Pass-by-Pass Optimization: Intent-Driven IR Optimization with Large Language Models
Llms

[2602.18511] Beyond Pass-by-Pass Optimization: Intent-Driven IR Optimization with Large Language Models

The paper presents IntOpt, an intent-driven IR optimizer that enhances program optimization by separating high-level intent from low-leve...

arXiv - AI · 4 min ·
[2602.19392] Spiking Graph Predictive Coding for Reliable OOD Generalization
Machine Learning

[2602.19392] Spiking Graph Predictive Coding for Reliable OOD Generalization

The paper introduces Spiking Graph Predictive Coding (SIGHT), a novel approach to enhance out-of-distribution (OOD) generalization in gra...

arXiv - Machine Learning · 3 min ·
[2602.19362] LLMs Can Learn to Reason Via Off-Policy RL
Llms

[2602.19362] LLMs Can Learn to Reason Via Off-Policy RL

The paper presents a novel off-policy reinforcement learning algorithm, OAPL, for Large Language Models (LLMs) that enhances reasoning ca...

arXiv - Machine Learning · 4 min ·
[2602.19332] Training-Free Cross-Architecture Merging for Graph Neural Networks
Machine Learning

[2602.19332] Training-Free Cross-Architecture Merging for Graph Neural Networks

The paper presents H-GRAMA, a training-free framework for merging heterogeneous Graph Neural Networks (GNNs), allowing efficient model in...

arXiv - Machine Learning · 3 min ·
[2602.18478] ZUNA: Flexible EEG Superresolution with Position-Aware Diffusion Autoencoders
Machine Learning

[2602.18478] ZUNA: Flexible EEG Superresolution with Position-Aware Diffusion Autoencoders

The paper presents ZUNA, a 380M-parameter masked diffusion autoencoder designed for EEG signal superresolution and channel infilling, dem...

arXiv - Machine Learning · 3 min ·
[2602.19330] CTS-Bench: Benchmarking Graph Coarsening Trade-offs for GNNs in Clock Tree Synthesis
Machine Learning

[2602.19330] CTS-Bench: Benchmarking Graph Coarsening Trade-offs for GNNs in Clock Tree Synthesis

The paper introduces CTS-Bench, a benchmark suite for evaluating graph coarsening trade-offs in Graph Neural Networks (GNNs) for Clock Tr...

arXiv - Machine Learning · 4 min ·
[2602.19271] Taming Preconditioner Drift: Unlocking the Potential of Second-Order Optimizers for Federated Learning on Non-IID Data
Machine Learning

[2602.19271] Taming Preconditioner Drift: Unlocking the Potential of Second-Order Optimizers for Federated Learning on Non-IID Data

This paper presents FedPAC, a framework to enhance the stability and accuracy of second-order optimizers in federated learning on non-IID...

arXiv - AI · 4 min ·
[2602.18471] Charting the Future of AI-supported Science Education: A Human-Centered Vision
Ai Infrastructure

[2602.18471] Charting the Future of AI-supported Science Education: A Human-Centered Vision

This article discusses the transformative potential of AI in science education, proposing a human-centered framework for its ethical inte...

arXiv - AI · 4 min ·
[2602.18470] Transforming Science Learning Materials in the Era of Artificial Intelligence
Generative Ai

[2602.18470] Transforming Science Learning Materials in the Era of Artificial Intelligence

This article explores how AI is reshaping science learning materials, enhancing personalization, accessibility, and interactivity while a...

arXiv - AI · 4 min ·
[2602.18469] The Landscape of AI in Science Education: What is Changing and How to Respond
Ai Infrastructure

[2602.18469] The Landscape of AI in Science Education: What is Changing and How to Respond

This article explores the transformative impact of AI on science education, highlighting changes in educational practices and the need fo...

arXiv - AI · 4 min ·
Previous Page 96 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime