AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · 42 minutes ago

Llms

[2603.10047] Toward Epistemic Stability: Engineering Consistent Procedures for Industrial LLM Hallucination Reduction

Abstract page for arXiv paper 2603.10047: Toward Epistemic Stability: Engineering Consistent Procedures for Industrial LLM Hallucination ...

arXiv - AI · 4 min · about 2 hours ago

Machine Learning

[2512.18388] Exploration vs. Fixation: Scaffolding Divergent and Convergent Thinking for Human-AI Co-Creation with Generative Models

Abstract page for arXiv paper 2512.18388: Exploration vs. Fixation: Scaffolding Divergent and Convergent Thinking for Human-AI Co-Creatio...

arXiv - AI · 4 min · about 2 hours ago

All Content

Llms

[2602.19938] A Replicate-and-Quantize Strategy for Plug-and-Play Load Balancing of Sparse Mixture-of-Experts LLMs

The paper presents a Replicate-and-Quantize strategy for improving load balancing in Sparse Mixture-of-Experts (SMoE) models, enhancing i...

arXiv - Machine Learning · 4 min · about 1 month ago

Ai Infrastructure

[2602.18844] When Agda met Vampire

The paper discusses integrating proof assistants like Agda with automated theorem provers (ATPs) to enhance automation in mechanized math...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.19926] Rethinking LoRA for Privacy-Preserving Federated Learning in Large Models

The paper presents LA-LoRA, a novel approach for fine-tuning large models in privacy-preserving federated learning, addressing key challe...

arXiv - AI · 4 min · about 1 month ago

Ai Agents

[2602.18797] Carbon-aware decentralized dynamic task offloading in MIMO-MEC networks via multi-agent reinforcement learning

This paper presents CADDTO-PPO, a carbon-aware decentralized task offloading framework for MIMO-MEC networks using multi-agent reinforcem...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.18782] MANATEE: Inference-Time Lightweight Diffusion Based Safety Defense for LLMs

The paper presents MANATEE, a novel defense mechanism for large language models (LLMs) against adversarial attacks, utilizing a lightweig...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.19845] I Dropped a Neural Net

The paper 'I Dropped a Neural Net' explores a unique challenge in machine learning, where a neural network's layers are shuffled and need...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.18758] UFO: Unlocking Ultra-Efficient Quantized Private Inference with Protocol and Algorithm Co-Optimization

The paper presents UFO, a quantized two-party computation framework that optimizes private CNN inference by combining efficient protocols...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.18745] Synthesizing Multimodal Geometry Datasets from Scratch and Enabling Visual Alignment via Plotting Code

The paper presents a novel pipeline for synthesizing multimodal geometry datasets, introducing the GeoCode dataset which enhances visual-...

arXiv - AI · 3 min · about 1 month ago

Ai Agents

[2602.18705] EDU-MATRIX: A Society-Centric Generative Cognitive Digital Twin Architecture for Secondary Education

The EDU-MATRIX paper presents a novel generative cognitive digital twin architecture aimed at enhancing secondary education through a soc...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.18678] Heterogeneity-agnostic AI/ML-assisted beam selection for multi-panel arrays

This paper presents a novel AI/ML-based beam selection algorithm that addresses the challenges posed by heterogeneous antenna configurati...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.19622] VecFormer: Towards Efficient and Generalizable Graph Transformer with Graph Token Attention

VecFormer introduces a novel Graph Transformer model that enhances efficiency and generalization in node classification, addressing compu...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.19610] Variational Inference for Bayesian MIDAS Regression

This paper presents a Coordinate Ascent Variational Inference (CAVI) algorithm for Bayesian MIDAS regression, demonstrating significant s...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.19594] ISO-Bench: Can Coding Agents Optimize Real-World Inference Workloads?

ISO-Bench introduces a benchmark for coding agents to optimize real-world inference workloads, evaluating their performance against exper...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.19580] Leap+Verify: Regime-Adaptive Speculative Weight Prediction for Accelerating Neural Network Training

The paper introduces Leap+Verify, a framework that enhances neural network training through speculative weight prediction, adapting to di...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.18568] RPU -- A Reasoning Processing Unit

The paper introduces the Reasoning Processing Unit (RPU), a novel chiplet-based architecture designed to overcome memory bandwidth limita...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.18532] VLANeXt: Recipes for Building Strong VLA Models

The paper presents VLANeXt, a framework for building effective Vision-Language-Action (VLA) models, addressing inconsistencies in trainin...

arXiv - AI · 4 min · about 1 month ago

Nlp

[2602.19498] Softmax is not Enough (for Adaptive Conformal Classification)

The paper critiques the reliance on softmax outputs in adaptive conformal classification, proposing a new method that utilizes pre-softma...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.19489] Federated Learning Playground

The article presents the Federated Learning Playground, an interactive platform designed to teach core concepts of Federated Learning thr...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.18520] Sketch2Feedback: Grammar-in-the-Loop Framework for Rubric-Aligned Feedback on Student STEM Diagrams

The paper presents Sketch2Feedback, a framework that enhances feedback on student-drawn STEM diagrams by integrating grammar rules to red...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.19414] Federated Causal Representation Learning in State-Space Systems for Decentralized Counterfactual Reasoning

This paper presents a federated framework for causal representation learning in state-space systems, enabling decentralized counterfactua...

arXiv - Machine Learning · 3 min · about 1 month ago

Previous Page 102 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence

[2603.10047] Toward Epistemic Stability: Engineering Consistent Procedures for Industrial LLM Hallucination Reduction

[2512.18388] Exploration vs. Fixation: Scaffolding Divergent and Convergent Thinking for Human-AI Co-Creation with Generative Models

All Content

[2602.19938] A Replicate-and-Quantize Strategy for Plug-and-Play Load Balancing of Sparse Mixture-of-Experts LLMs

[2602.18844] When Agda met Vampire

[2602.19926] Rethinking LoRA for Privacy-Preserving Federated Learning in Large Models

[2602.18797] Carbon-aware decentralized dynamic task offloading in MIMO-MEC networks via multi-agent reinforcement learning

[2602.18782] MANATEE: Inference-Time Lightweight Diffusion Based Safety Defense for LLMs

[2602.19845] I Dropped a Neural Net

[2602.18758] UFO: Unlocking Ultra-Efficient Quantized Private Inference with Protocol and Algorithm Co-Optimization

[2602.18745] Synthesizing Multimodal Geometry Datasets from Scratch and Enabling Visual Alignment via Plotting Code

[2602.18705] EDU-MATRIX: A Society-Centric Generative Cognitive Digital Twin Architecture for Secondary Education

[2602.18678] Heterogeneity-agnostic AI/ML-assisted beam selection for multi-panel arrays

[2602.19622] VecFormer: Towards Efficient and Generalizable Graph Transformer with Graph Token Attention

[2602.19610] Variational Inference for Bayesian MIDAS Regression

[2602.19594] ISO-Bench: Can Coding Agents Optimize Real-World Inference Workloads?

[2602.19580] Leap+Verify: Regime-Adaptive Speculative Weight Prediction for Accelerating Neural Network Training

[2602.18568] RPU -- A Reasoning Processing Unit

[2602.18532] VLANeXt: Recipes for Building Strong VLA Models

[2602.19498] Softmax is not Enough (for Adaptive Conformal Classification)

[2602.19489] Federated Learning Playground

[2602.18520] Sketch2Feedback: Grammar-in-the-Loop Framework for Rubric-Aligned Feedback on Student STEM Diagrams

[2602.19414] Federated Causal Representation Learning in State-Space Systems for Decentralized Counterfactual Reasoning

Related Topics

Stay updated with AI News