AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 1 hour ago

Llms

[R] Looking for arXiv cs.LG endorser, inference monitoring using information geometry

Hi r/MachineLearning, I’m looking for an arXiv endorser in cs.LG for a paper on inference-time distribution shift detection for deployed ...

Reddit - Machine Learning · 1 min · about 6 hours ago

Llms

Nvidia goes all-in on AI agents while Anthropic pulls the plug

TLDR: Nvidia is partnering with 17 major companies to build a platform specifically for enterprise AI agents, basically trying to become ...

Reddit - Artificial Intelligence · 1 min · about 16 hours ago

All Content

Llms

[2507.04517] DOTResize: Reducing LLM Width via Discrete Optimal Transport-based Neuron Merging

The paper presents DOTResize, a novel method for reducing the width of Large Language Models (LLMs) through Discrete Optimal Transport-ba...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2507.12652] Federated Learning in Offline and Online EMG Decoding: A Privacy and Performance Perspective

This article explores the application of federated learning (FL) in offline and online EMG decoding, addressing privacy and performance c...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2506.21427] Flow-Based Single-Step Completion for Efficient and Expressive Policy Learning

The paper presents the Single-Step Completion Policy (SSCP), a novel approach in reinforcement learning that enhances efficiency and expr...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2504.07835] Pychop: Emulating Low-Precision Arithmetic in Numerical Methods and Neural Networks

The paper presents Pychop, a Python library that emulates low-precision arithmetic for numerical methods and neural networks, enhancing c...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2505.23725] MuLoCo: Muon is a practical inner optimizer for DiLoCo

The paper presents MuLoCo, a new inner optimizer for the DiLoCo framework, demonstrating its superior performance in training large langu...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2502.00944] Training speedups via batching for geometric learning: an analysis of static and dynamic algorithms

This article analyzes the impact of static and dynamic batching algorithms on training speed and performance in graph neural networks (GN...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.22056] FlowCorrect: Efficient Interactive Correction of Generative Flow Policies for Robotic Manipulation

The paper presents FlowCorrect, a framework for correcting generative flow policies in robotic manipulation using minimal human input, im...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.21873] GFPL: Generative Federated Prototype Learning for Resource-Constrained and Data-Imbalanced Vision Task

The GFPL framework enhances federated learning by addressing data imbalance and communication overhead in resource-constrained vision tas...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.21846] Scalable Kernel-Based Distances for Statistical Inference and Integration

This paper explores scalable kernel-based distances for statistical inference, focusing on the maximum mean discrepancy (MMD) and introdu...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.21788] DHP: Efficient Scaling of MLLM Training with Dynamic Hybrid Parallelism

The paper presents Dynamic Hybrid Parallelism (DHP), a new strategy for efficiently scaling the training of Multimodal Large Language Mod...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.21756] Offline Reasoning for Efficient Recommendation: LLM-Empowered Persona-Profiled Item Indexing

The paper presents Persona4Rec, a novel recommendation framework that utilizes offline reasoning with large language models (LLMs) to cre...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.21721] Private and Robust Contribution Evaluation in Federated Learning

This paper presents novel methods for evaluating contributions in federated learning while ensuring privacy and robustness, addressing vu...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.21478] Efficient Inference after Directionally Stable Adaptive Experiments

This paper explores efficient inference methods for adaptive experiments, introducing the concept of directional stability, which enhance...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.21428] PSF-Med: Measuring and Explaining Paraphrase Sensitivity in Medical Vision Language Models

The paper introduces PSF-Med, a benchmark assessing paraphrase sensitivity in medical vision language models, revealing significant varia...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.21357] Conditional neural control variates for variance reduction in Bayesian inverse problems

This paper presents a novel approach using conditional neural control variates to reduce variance in Bayesian inverse problems, enhancing...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.21265] ToolMATH: A Math Tool Benchmark for Realistic Long-Horizon Multi-Tool Reasoning

ToolMATH introduces a benchmark for evaluating tool-augmented language models in realistic multi-tool environments, focusing on long-hori...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.22136] SigmaQuant: Hardware-Aware Heterogeneous Quantization Method for Edge DNN Inference

The paper introduces SigmaQuant, a hardware-aware heterogeneous quantization method for deep neural networks (DNNs) aimed at optimizing p...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.22015] Function-Space Empirical Bayes Regularisation with Student's t Priors

This paper presents a novel function-space empirical Bayes regularisation framework using heavy-tailed Student's t priors to improve Baye...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.21965] Compact Circulant Layers with Spectral Priors

This paper explores compact circulant layers with spectral priors, focusing on their application in memory-efficient neural networks for ...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.21824] DocDjinn: Controllable Synthetic Document Generation with VLMs and Handwriting Diffusion

DocDjinn introduces a framework for generating synthetic documents using Vision-Language Models (VLMs), addressing challenges in data acq...

arXiv - Machine Learning · 4 min · about 1 month ago

Previous Page 76 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence

[R] Looking for arXiv cs.LG endorser, inference monitoring using information geometry

Nvidia goes all-in on AI agents while Anthropic pulls the plug

All Content

[2507.04517] DOTResize: Reducing LLM Width via Discrete Optimal Transport-based Neuron Merging

[2507.12652] Federated Learning in Offline and Online EMG Decoding: A Privacy and Performance Perspective

[2506.21427] Flow-Based Single-Step Completion for Efficient and Expressive Policy Learning

[2504.07835] Pychop: Emulating Low-Precision Arithmetic in Numerical Methods and Neural Networks

[2505.23725] MuLoCo: Muon is a practical inner optimizer for DiLoCo

[2502.00944] Training speedups via batching for geometric learning: an analysis of static and dynamic algorithms

[2602.22056] FlowCorrect: Efficient Interactive Correction of Generative Flow Policies for Robotic Manipulation

[2602.21873] GFPL: Generative Federated Prototype Learning for Resource-Constrained and Data-Imbalanced Vision Task

[2602.21846] Scalable Kernel-Based Distances for Statistical Inference and Integration

[2602.21788] DHP: Efficient Scaling of MLLM Training with Dynamic Hybrid Parallelism

[2602.21756] Offline Reasoning for Efficient Recommendation: LLM-Empowered Persona-Profiled Item Indexing

[2602.21721] Private and Robust Contribution Evaluation in Federated Learning

[2602.21478] Efficient Inference after Directionally Stable Adaptive Experiments

[2602.21428] PSF-Med: Measuring and Explaining Paraphrase Sensitivity in Medical Vision Language Models

[2602.21357] Conditional neural control variates for variance reduction in Bayesian inverse problems

[2602.21265] ToolMATH: A Math Tool Benchmark for Realistic Long-Horizon Multi-Tool Reasoning

[2602.22136] SigmaQuant: Hardware-Aware Heterogeneous Quantization Method for Edge DNN Inference

[2602.22015] Function-Space Empirical Bayes Regularisation with Student's t Priors

[2602.21965] Compact Circulant Layers with Spectral Priors

[2602.21824] DocDjinn: Controllable Synthetic Document Generation with VLMs and Handwriting Diffusion

Related Topics

Stay updated with AI News