AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Infrastructure

Integrate Physical AI Capabilities into Existing Apps with NVIDIA Omniverse Libraries

Physical AI—AI systems that perceive, reason, and act in physically grounded simulated environments—is changing how teams design and vali...

AI Tools & Products · 14 min · 31 minutes ago

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 1 hour ago

Ai Infrastructure

Dell and HIVE partner to deploy Nvidia’s next-generation AI chips

AI News - General · 1 min · about 1 hour ago

All Content

Ai Infrastructure

[2510.18478] Safe But Not Sorry: Reducing Over-Conservatism in Safety Critics via Uncertainty-Aware Modulation

This article presents the Uncertain Safety Critic (USC), a novel approach to enhance safety in reinforcement learning (RL) by balancing s...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2509.15194] Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation

The paper presents EVOL-RL, a novel framework for evolving language models without labels, balancing majority-driven stability with novel...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2508.12907] SNAP-UQ: Self-supervised Next-Activation Prediction for Single-Pass Uncertainty in TinyML

The paper presents SNAP-UQ, a novel method for single-pass uncertainty estimation in TinyML, enhancing reliability in on-device monitorin...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2501.16534] Targeting Alignment: Extracting Safety Classifiers of Aligned LLMs

This article presents a novel technique for extracting safety classifiers from aligned large language models (LLMs) to address vulnerabil...

arXiv - AI · 4 min · about 2 months ago

Llms

[2501.14406] Adaptive Rank Allocation for Federated Parameter-Efficient Fine-Tuning of Language Models

The paper presents FedARA, an innovative framework for federated parameter-efficient fine-tuning of language models, addressing data hete...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2508.10480] Pinet: Optimizing hard-constrained neural networks with orthogonal projection layers

The paper introduces $ ext{Pinet}$, a novel output layer for neural networks that optimizes hard constraints using orthogonal projection ...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2507.12257] Robust Causal Discovery in Real-World Time Series with Power-Laws

This paper presents a novel method for causal discovery in time series data, leveraging power-law distributions to enhance robustness aga...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2506.14202] DiffusionBlocks: Block-wise Neural Network Training via Diffusion Interpretation

The paper introduces DiffusionBlocks, a framework for block-wise training of neural networks that reduces memory bottlenecks while mainta...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.11348] AgentNoiseBench: Benchmarking Robustness of Tool-Using LLM Agents Under Noisy Condition

The paper introduces AgentNoiseBench, a framework for evaluating the robustness of tool-using LLM agents under noisy conditions, highligh...

arXiv - AI · 4 min · about 2 months ago

Llms

[2505.19427] WINA: Weight Informed Neuron Activation for Accelerating Large Language Model Inference

The paper introduces WINA, a novel framework for efficient inference in large language models (LLMs) that optimally combines hidden state...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.02050] Rethinking the Role of Entropy in Optimizing Tool-Use Behaviors for Large Language Model Agents

This article explores the role of entropy in optimizing tool-use behaviors for Large Language Model (LLM) agents, highlighting the correl...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2504.06768] FedMerge: Federated Personalization via Model Merging

The paper introduces FedMerge, a novel approach in federated learning that enables personalized model creation for clients by merging mul...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2504.05615] FedEFC: Federated Learning Using Enhanced Forward Correction Against Noisy Labels

The paper presents FedEFC, a novel approach to federated learning that addresses the challenges posed by noisy labels through techniques ...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2601.01569] CaveAgent: Transforming LLMs into Stateful Runtime Operators

CaveAgent introduces a novel framework that transforms LLMs into stateful runtime operators, enhancing their ability to manage complex ta...

arXiv - AI · 4 min · about 2 months ago

Llms

[2510.12121] Precise Attribute Intensity Control in Large Language Models via Targeted Representation Editing

This paper introduces a method for precise control of attribute intensities in Large Language Models (LLMs) through targeted representati...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2502.07274] Forget Forgetting: Continual Learning in a World of Abundant Memory

The paper explores continual learning (CL) in AI, proposing a shift from minimizing memory usage to leveraging abundant memory while addr...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2502.00213] Understanding Transformer Optimization via Gradient Heterogeneity

This paper explores the optimization challenges of Transformer models, focusing on gradient heterogeneity and its impact on convergence w...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2509.00074] Language and Experience: A Computational Model of Social Learning in Complex Tasks

This article presents a computational model that explores how humans and AI can integrate linguistic guidance and direct experience for e...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2409.04332] Amortized Bayesian Workflow

The paper presents an Amortized Bayesian Workflow that combines fast amortized inference with accurate MCMC techniques, optimizing Bayesi...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2411.04760] Zero-Shot Temporal Resolution Domain Adaptation for Spiking Neural Networks

This paper presents novel domain adaptation methods for Spiking Neural Networks (SNNs) to address performance drops due to mismatched tem...

arXiv - Machine Learning · 4 min · about 2 months ago

Previous Page 125 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

Integrate Physical AI Capabilities into Existing Apps with NVIDIA Omniverse Libraries

UMKC Announces New Master of Science in Artificial Intelligence

Dell and HIVE partner to deploy Nvidia’s next-generation AI chips

All Content

[2510.18478] Safe But Not Sorry: Reducing Over-Conservatism in Safety Critics via Uncertainty-Aware Modulation

[2509.15194] Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation

[2508.12907] SNAP-UQ: Self-supervised Next-Activation Prediction for Single-Pass Uncertainty in TinyML

[2501.16534] Targeting Alignment: Extracting Safety Classifiers of Aligned LLMs

[2501.14406] Adaptive Rank Allocation for Federated Parameter-Efficient Fine-Tuning of Language Models

[2508.10480] Pinet: Optimizing hard-constrained neural networks with orthogonal projection layers

[2507.12257] Robust Causal Discovery in Real-World Time Series with Power-Laws

[2506.14202] DiffusionBlocks: Block-wise Neural Network Training via Diffusion Interpretation

[2602.11348] AgentNoiseBench: Benchmarking Robustness of Tool-Using LLM Agents Under Noisy Condition

[2505.19427] WINA: Weight Informed Neuron Activation for Accelerating Large Language Model Inference

[2602.02050] Rethinking the Role of Entropy in Optimizing Tool-Use Behaviors for Large Language Model Agents

[2504.06768] FedMerge: Federated Personalization via Model Merging

[2504.05615] FedEFC: Federated Learning Using Enhanced Forward Correction Against Noisy Labels

[2601.01569] CaveAgent: Transforming LLMs into Stateful Runtime Operators

[2510.12121] Precise Attribute Intensity Control in Large Language Models via Targeted Representation Editing

[2502.07274] Forget Forgetting: Continual Learning in a World of Abundant Memory

[2502.00213] Understanding Transformer Optimization via Gradient Heterogeneity

[2509.00074] Language and Experience: A Computational Model of Social Learning in Complex Tasks

[2409.04332] Amortized Bayesian Workflow

[2411.04760] Zero-Shot Temporal Resolution Domain Adaptation for Spiking Neural Networks

Related Topics

Stay updated with AI News