AI Infrastructure

GPUs, training clusters, MLOps, and deployment

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
[2603.10047] Toward Epistemic Stability: Engineering Consistent Procedures for Industrial LLM Hallucination Reduction
Llms

[2603.10047] Toward Epistemic Stability: Engineering Consistent Procedures for Industrial LLM Hallucination Reduction

Abstract page for arXiv paper 2603.10047: Toward Epistemic Stability: Engineering Consistent Procedures for Industrial LLM Hallucination ...

arXiv - AI · 4 min ·
[2512.18388] Exploration vs. Fixation: Scaffolding Divergent and Convergent Thinking for Human-AI Co-Creation with Generative Models
Machine Learning

[2512.18388] Exploration vs. Fixation: Scaffolding Divergent and Convergent Thinking for Human-AI Co-Creation with Generative Models

Abstract page for arXiv paper 2512.18388: Exploration vs. Fixation: Scaffolding Divergent and Convergent Thinking for Human-AI Co-Creatio...

arXiv - AI · 4 min ·

All Content

[2602.19938] A Replicate-and-Quantize Strategy for Plug-and-Play Load Balancing of Sparse Mixture-of-Experts LLMs
Llms

[2602.19938] A Replicate-and-Quantize Strategy for Plug-and-Play Load Balancing of Sparse Mixture-of-Experts LLMs

The paper presents a Replicate-and-Quantize strategy for improving load balancing in Sparse Mixture-of-Experts (SMoE) models, enhancing i...

arXiv - Machine Learning · 4 min ·
[2602.18844] When Agda met Vampire
Ai Infrastructure

[2602.18844] When Agda met Vampire

The paper discusses integrating proof assistants like Agda with automated theorem provers (ATPs) to enhance automation in mechanized math...

arXiv - AI · 3 min ·
[2602.19926] Rethinking LoRA for Privacy-Preserving Federated Learning in Large Models
Llms

[2602.19926] Rethinking LoRA for Privacy-Preserving Federated Learning in Large Models

The paper presents LA-LoRA, a novel approach for fine-tuning large models in privacy-preserving federated learning, addressing key challe...

arXiv - AI · 4 min ·
[2602.18797] Carbon-aware decentralized dynamic task offloading in MIMO-MEC networks via multi-agent reinforcement learning
Ai Agents

[2602.18797] Carbon-aware decentralized dynamic task offloading in MIMO-MEC networks via multi-agent reinforcement learning

This paper presents CADDTO-PPO, a carbon-aware decentralized task offloading framework for MIMO-MEC networks using multi-agent reinforcem...

arXiv - Machine Learning · 4 min ·
[2602.18782] MANATEE: Inference-Time Lightweight Diffusion Based Safety Defense for LLMs
Llms

[2602.18782] MANATEE: Inference-Time Lightweight Diffusion Based Safety Defense for LLMs

The paper presents MANATEE, a novel defense mechanism for large language models (LLMs) against adversarial attacks, utilizing a lightweig...

arXiv - Machine Learning · 3 min ·
[2602.19845] I Dropped a Neural Net
Machine Learning

[2602.19845] I Dropped a Neural Net

The paper 'I Dropped a Neural Net' explores a unique challenge in machine learning, where a neural network's layers are shuffled and need...

arXiv - Machine Learning · 3 min ·
[2602.18758] UFO: Unlocking Ultra-Efficient Quantized Private Inference with Protocol and Algorithm Co-Optimization
Machine Learning

[2602.18758] UFO: Unlocking Ultra-Efficient Quantized Private Inference with Protocol and Algorithm Co-Optimization

The paper presents UFO, a quantized two-party computation framework that optimizes private CNN inference by combining efficient protocols...

arXiv - AI · 4 min ·
[2602.18745] Synthesizing Multimodal Geometry Datasets from Scratch and Enabling Visual Alignment via Plotting Code
Llms

[2602.18745] Synthesizing Multimodal Geometry Datasets from Scratch and Enabling Visual Alignment via Plotting Code

The paper presents a novel pipeline for synthesizing multimodal geometry datasets, introducing the GeoCode dataset which enhances visual-...

arXiv - AI · 3 min ·
[2602.18705] EDU-MATRIX: A Society-Centric Generative Cognitive Digital Twin Architecture for Secondary Education
Ai Agents

[2602.18705] EDU-MATRIX: A Society-Centric Generative Cognitive Digital Twin Architecture for Secondary Education

The EDU-MATRIX paper presents a novel generative cognitive digital twin architecture aimed at enhancing secondary education through a soc...

arXiv - AI · 3 min ·
[2602.18678] Heterogeneity-agnostic AI/ML-assisted beam selection for multi-panel arrays
Machine Learning

[2602.18678] Heterogeneity-agnostic AI/ML-assisted beam selection for multi-panel arrays

This paper presents a novel AI/ML-based beam selection algorithm that addresses the challenges posed by heterogeneous antenna configurati...

arXiv - Machine Learning · 3 min ·
[2602.19622] VecFormer: Towards Efficient and Generalizable Graph Transformer with Graph Token Attention
Machine Learning

[2602.19622] VecFormer: Towards Efficient and Generalizable Graph Transformer with Graph Token Attention

VecFormer introduces a novel Graph Transformer model that enhances efficiency and generalization in node classification, addressing compu...

arXiv - AI · 4 min ·
[2602.19610] Variational Inference for Bayesian MIDAS Regression
Machine Learning

[2602.19610] Variational Inference for Bayesian MIDAS Regression

This paper presents a Coordinate Ascent Variational Inference (CAVI) algorithm for Bayesian MIDAS regression, demonstrating significant s...

arXiv - Machine Learning · 4 min ·
[2602.19594] ISO-Bench: Can Coding Agents Optimize Real-World Inference Workloads?
Llms

[2602.19594] ISO-Bench: Can Coding Agents Optimize Real-World Inference Workloads?

ISO-Bench introduces a benchmark for coding agents to optimize real-world inference workloads, evaluating their performance against exper...

arXiv - Machine Learning · 3 min ·
[2602.19580] Leap+Verify: Regime-Adaptive Speculative Weight Prediction for Accelerating Neural Network Training
Llms

[2602.19580] Leap+Verify: Regime-Adaptive Speculative Weight Prediction for Accelerating Neural Network Training

The paper introduces Leap+Verify, a framework that enhances neural network training through speculative weight prediction, adapting to di...

arXiv - Machine Learning · 4 min ·
[2602.18568] RPU -- A Reasoning Processing Unit
Llms

[2602.18568] RPU -- A Reasoning Processing Unit

The paper introduces the Reasoning Processing Unit (RPU), a novel chiplet-based architecture designed to overcome memory bandwidth limita...

arXiv - AI · 3 min ·
[2602.18532] VLANeXt: Recipes for Building Strong VLA Models
Llms

[2602.18532] VLANeXt: Recipes for Building Strong VLA Models

The paper presents VLANeXt, a framework for building effective Vision-Language-Action (VLA) models, addressing inconsistencies in trainin...

arXiv - AI · 4 min ·
[2602.19498] Softmax is not Enough (for Adaptive Conformal Classification)
Nlp

[2602.19498] Softmax is not Enough (for Adaptive Conformal Classification)

The paper critiques the reliance on softmax outputs in adaptive conformal classification, proposing a new method that utilizes pre-softma...

arXiv - AI · 4 min ·
[2602.19489] Federated Learning Playground
Machine Learning

[2602.19489] Federated Learning Playground

The article presents the Federated Learning Playground, an interactive platform designed to teach core concepts of Federated Learning thr...

arXiv - AI · 3 min ·
[2602.18520] Sketch2Feedback: Grammar-in-the-Loop Framework for Rubric-Aligned Feedback on Student STEM Diagrams
Machine Learning

[2602.18520] Sketch2Feedback: Grammar-in-the-Loop Framework for Rubric-Aligned Feedback on Student STEM Diagrams

The paper presents Sketch2Feedback, a framework that enhances feedback on student-drawn STEM diagrams by integrating grammar rules to red...

arXiv - AI · 4 min ·
[2602.19414] Federated Causal Representation Learning in State-Space Systems for Decentralized Counterfactual Reasoning
Machine Learning

[2602.19414] Federated Causal Representation Learning in State-Space Systems for Decentralized Counterfactual Reasoning

This paper presents a federated framework for causal representation learning in state-space systems, enabling decentralized counterfactua...

arXiv - Machine Learning · 3 min ·
Previous Page 102 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime