AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 7 hours ago

Llms

What is the current landscape on AI agents knowledge

Recently used "free" rates codex to give me a quick fastapi project sample. It gave me deprecated (a)app.on_event("startup). What are you...

Reddit - Artificial Intelligence · 1 min · about 18 hours ago

Open Source Ai

NVIDIA Isaac GR00T N1.7: Open Reasoning VLA Model for Humanoid Robots

A Blog post by NVIDIA on Hugging Face

Hugging Face Blog · 4 min · about 23 hours ago

All Content

Machine Learning

[2505.11304] Heterogeneity-Aware Client Sampling for Optimal and Efficient Federated Learning

This paper presents a novel approach to federated learning by addressing the challenges posed by heterogeneous client capabilities. The p...

arXiv - AI · 4 min · 2 months ago

Llms

[2508.03346] Making Slow Thinking Faster: Compressing LLM Chain-of-Thought via Step Entropy

This article presents a novel framework for compressing Chain-of-Thought (CoT) prompts in Large Language Models (LLMs) to enhance inferen...

arXiv - AI · 4 min · 2 months ago

Machine Learning

[2505.08145] A Generalized Hierarchical Federated Learning Framework with Theoretical Guarantees

This article presents a novel Multi-Layer Hierarchical Federated Learning framework (QMLHFL) that enhances scalability and flexibility in...

arXiv - Machine Learning · 4 min · 2 months ago

Machine Learning

[2505.06795] Sparse Latent Factor Forecaster (SLFF) with Iterative Inference for Transparent Multi-Horizon Commodity Futures Prediction

The Sparse Latent Factor Forecaster (SLFF) proposes a new approach for predicting commodity futures by addressing forecast errors and enh...

arXiv - AI · 4 min · 2 months ago

Machine Learning

[2411.06403] Mastering NIM and Impartial Games with Weak Neural Networks: An AlphaZero-inspired Multi-Frame Approach

This paper explores the application of weak neural networks in mastering impartial games like NIM, utilizing an AlphaZero-inspired multi-...

arXiv - AI · 4 min · 2 months ago

Llms

[2503.08796] Robust Multi-Objective Controlled Decoding of Large Language Models

This article presents Robust Multi-Objective Decoding (RMOD), an innovative algorithm designed to enhance the performance of Large Langua...

arXiv - AI · 3 min · 2 months ago

Llms

[2502.05376] LO-BCQ: Block Clustered Quantization for 4-bit (W4A4) LLM Inference

The paper presents LO-BCQ, a novel block clustered quantization method for 4-bit LLM inference, achieving less than 1% accuracy loss whil...

arXiv - Machine Learning · 4 min · 2 months ago

Machine Learning

[2602.14917] BFS-PO: Best-First Search for Large Reasoning Models

The paper proposes BFS-PO, a new reinforcement learning algorithm that enhances the performance of Large Reasoning Models by reducing com...

arXiv - AI · 3 min · 2 months ago

Llms

[2501.16178] SWIFT: Mapping Sub-series with Wavelet Decomposition Improves Time Series Forecasting

The paper presents SWIFT, a lightweight model that enhances time series forecasting using wavelet decomposition, achieving state-of-the-a...

arXiv - Machine Learning · 4 min · 2 months ago

Machine Learning

[2501.15889] Adaptive Width Neural Networks

The paper introduces Adaptive Width Neural Networks, a novel approach that optimizes the width of neural network layers during training, ...

arXiv - AI · 4 min · 2 months ago

Machine Learning

[2501.05633] Regularized Top-$k$: A Bayesian Framework for Gradient Sparsification

The paper presents a Bayesian framework for gradient sparsification called Regularized Top-k (RegTop-k), which improves convergence in di...

arXiv - Machine Learning · 4 min · 2 months ago

Machine Learning

[2411.08982] Lynx: Enabling Efficient MoE Inference through Dynamic Batch-Aware Expert Selection

The paper introduces Lynx, a system designed to enhance the efficiency of Mixture-of-Expert (MoE) models by implementing dynamic batch-aw...

arXiv - Machine Learning · 4 min · 2 months ago

Machine Learning

[2411.16085] Cautious Optimizers: Improving Training with One Line of Code

This article presents a new approach to optimizing training in machine learning by introducing a simple one-line modification to existing...

arXiv - AI · 3 min · 2 months ago

Llms

[2410.10481] Model-based Large Language Model Customization as Service

The paper presents Llamdex, a framework for customizing large language models (LLMs) as a service, allowing clients to upload domain-spec...

arXiv - AI · 4 min · 2 months ago

Llms

[2406.12844] Synergizing Foundation Models and Federated Learning: A Survey

This survey explores the integration of Foundation Models (FMs) and Federated Learning (FL), termed Federated Foundation Models (FedFM), ...

arXiv - AI · 4 min · 2 months ago

Llms

[2402.15751] Sparse MeZO: Less Parameters for Better Performance in Zeroth-Order LLM Fine-Tuning

The paper introduces Sparse MeZO, a novel optimization technique for fine-tuning large language models (LLMs) that reduces memory usage w...

arXiv - AI · 4 min · 2 months ago

Llms

[2602.14760] Residual Connections and the Causal Shift: Uncovering a Structural Misalignment in Transformers

This article explores a structural misalignment in Transformers, particularly regarding residual connections and their impact on next-tok...

arXiv - AI · 3 min · 2 months ago

Machine Learning

[2402.02644] Permutation-based Inference for Variational Learning of Directed Acyclic Graphs

This paper presents PIVID, a novel method for inferring distributions over permutations and directed acyclic graphs (DAGs) using variatio...

arXiv - Machine Learning · 3 min · 2 months ago

Llms

[2312.02355] When is Offline Policy Selection Sample Efficient for Reinforcement Learning?

This paper explores the efficiency of offline policy selection (OPS) in reinforcement learning, connecting it to off-policy evaluation (O...

arXiv - AI · 4 min · 2 months ago

Machine Learning

[2112.06251] Learning with Subset Stacking

The paper introduces a novel regression algorithm called Learning with Subset Stacking (LESS), which effectively learns from heterogeneou...

arXiv - Machine Learning · 3 min · 2 months ago

Previous Page 168 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence

What is the current landscape on AI agents knowledge

NVIDIA Isaac GR00T N1.7: Open Reasoning VLA Model for Humanoid Robots

All Content

[2505.11304] Heterogeneity-Aware Client Sampling for Optimal and Efficient Federated Learning

[2508.03346] Making Slow Thinking Faster: Compressing LLM Chain-of-Thought via Step Entropy

[2505.08145] A Generalized Hierarchical Federated Learning Framework with Theoretical Guarantees

[2505.06795] Sparse Latent Factor Forecaster (SLFF) with Iterative Inference for Transparent Multi-Horizon Commodity Futures Prediction

[2411.06403] Mastering NIM and Impartial Games with Weak Neural Networks: An AlphaZero-inspired Multi-Frame Approach

[2503.08796] Robust Multi-Objective Controlled Decoding of Large Language Models

[2502.05376] LO-BCQ: Block Clustered Quantization for 4-bit (W4A4) LLM Inference

[2602.14917] BFS-PO: Best-First Search for Large Reasoning Models

[2501.16178] SWIFT: Mapping Sub-series with Wavelet Decomposition Improves Time Series Forecasting

[2501.15889] Adaptive Width Neural Networks

[2501.05633] Regularized Top-$k$: A Bayesian Framework for Gradient Sparsification

[2411.08982] Lynx: Enabling Efficient MoE Inference through Dynamic Batch-Aware Expert Selection

[2411.16085] Cautious Optimizers: Improving Training with One Line of Code

[2410.10481] Model-based Large Language Model Customization as Service

[2406.12844] Synergizing Foundation Models and Federated Learning: A Survey

[2402.15751] Sparse MeZO: Less Parameters for Better Performance in Zeroth-Order LLM Fine-Tuning

[2602.14760] Residual Connections and the Causal Shift: Uncovering a Structural Misalignment in Transformers

[2402.02644] Permutation-based Inference for Variational Learning of Directed Acyclic Graphs

[2312.02355] When is Offline Policy Selection Sample Efficient for Reinforcement Learning?

[2112.06251] Learning with Subset Stacking

Related Topics

Stay updated with AI News