AI Infrastructure

GPUs, training clusters, MLOps, and deployment

Top This Week

Integrate Physical AI Capabilities into Existing Apps with NVIDIA Omniverse Libraries
Ai Infrastructure

Integrate Physical AI Capabilities into Existing Apps with NVIDIA Omniverse Libraries

Physical AI—AI systems that perceive, reason, and act in physically grounded simulated environments—is changing how teams design and vali...

AI Tools & Products · 14 min ·
UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Ai Infrastructure

Dell and HIVE partner to deploy Nvidia’s next-generation AI chips

AI News - General · 1 min ·

All Content

[2510.18478] Safe But Not Sorry: Reducing Over-Conservatism in Safety Critics via Uncertainty-Aware Modulation
Ai Infrastructure

[2510.18478] Safe But Not Sorry: Reducing Over-Conservatism in Safety Critics via Uncertainty-Aware Modulation

This article presents the Uncertain Safety Critic (USC), a novel approach to enhance safety in reinforcement learning (RL) by balancing s...

arXiv - Machine Learning · 3 min ·
[2509.15194] Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation
Llms

[2509.15194] Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation

The paper presents EVOL-RL, a novel framework for evolving language models without labels, balancing majority-driven stability with novel...

arXiv - Machine Learning · 4 min ·
[2508.12907] SNAP-UQ: Self-supervised Next-Activation Prediction for Single-Pass Uncertainty in TinyML
Machine Learning

[2508.12907] SNAP-UQ: Self-supervised Next-Activation Prediction for Single-Pass Uncertainty in TinyML

The paper presents SNAP-UQ, a novel method for single-pass uncertainty estimation in TinyML, enhancing reliability in on-device monitorin...

arXiv - Machine Learning · 4 min ·
[2501.16534] Targeting Alignment: Extracting Safety Classifiers of Aligned LLMs
Llms

[2501.16534] Targeting Alignment: Extracting Safety Classifiers of Aligned LLMs

This article presents a novel technique for extracting safety classifiers from aligned large language models (LLMs) to address vulnerabil...

arXiv - AI · 4 min ·
[2501.14406] Adaptive Rank Allocation for Federated Parameter-Efficient Fine-Tuning of Language Models
Llms

[2501.14406] Adaptive Rank Allocation for Federated Parameter-Efficient Fine-Tuning of Language Models

The paper presents FedARA, an innovative framework for federated parameter-efficient fine-tuning of language models, addressing data hete...

arXiv - Machine Learning · 4 min ·
[2508.10480] Pinet: Optimizing hard-constrained neural networks with orthogonal projection layers
Machine Learning

[2508.10480] Pinet: Optimizing hard-constrained neural networks with orthogonal projection layers

The paper introduces $ ext{Pinet}$, a novel output layer for neural networks that optimizes hard constraints using orthogonal projection ...

arXiv - Machine Learning · 3 min ·
[2507.12257] Robust Causal Discovery in Real-World Time Series with Power-Laws
Machine Learning

[2507.12257] Robust Causal Discovery in Real-World Time Series with Power-Laws

This paper presents a novel method for causal discovery in time series data, leveraging power-law distributions to enhance robustness aga...

arXiv - Machine Learning · 3 min ·
[2506.14202] DiffusionBlocks: Block-wise Neural Network Training via Diffusion Interpretation
Machine Learning

[2506.14202] DiffusionBlocks: Block-wise Neural Network Training via Diffusion Interpretation

The paper introduces DiffusionBlocks, a framework for block-wise training of neural networks that reduces memory bottlenecks while mainta...

arXiv - Machine Learning · 4 min ·
[2602.11348] AgentNoiseBench: Benchmarking Robustness of Tool-Using LLM Agents Under Noisy Condition
Llms

[2602.11348] AgentNoiseBench: Benchmarking Robustness of Tool-Using LLM Agents Under Noisy Condition

The paper introduces AgentNoiseBench, a framework for evaluating the robustness of tool-using LLM agents under noisy conditions, highligh...

arXiv - AI · 4 min ·
[2505.19427] WINA: Weight Informed Neuron Activation for Accelerating Large Language Model Inference
Llms

[2505.19427] WINA: Weight Informed Neuron Activation for Accelerating Large Language Model Inference

The paper introduces WINA, a novel framework for efficient inference in large language models (LLMs) that optimally combines hidden state...

arXiv - Machine Learning · 4 min ·
[2602.02050] Rethinking the Role of Entropy in Optimizing Tool-Use Behaviors for Large Language Model Agents
Llms

[2602.02050] Rethinking the Role of Entropy in Optimizing Tool-Use Behaviors for Large Language Model Agents

This article explores the role of entropy in optimizing tool-use behaviors for Large Language Model (LLM) agents, highlighting the correl...

arXiv - AI · 4 min ·
[2504.06768] FedMerge: Federated Personalization via Model Merging
Machine Learning

[2504.06768] FedMerge: Federated Personalization via Model Merging

The paper introduces FedMerge, a novel approach in federated learning that enables personalized model creation for clients by merging mul...

arXiv - Machine Learning · 4 min ·
[2504.05615] FedEFC: Federated Learning Using Enhanced Forward Correction Against Noisy Labels
Machine Learning

[2504.05615] FedEFC: Federated Learning Using Enhanced Forward Correction Against Noisy Labels

The paper presents FedEFC, a novel approach to federated learning that addresses the challenges posed by noisy labels through techniques ...

arXiv - Machine Learning · 4 min ·
[2601.01569] CaveAgent: Transforming LLMs into Stateful Runtime Operators
Llms

[2601.01569] CaveAgent: Transforming LLMs into Stateful Runtime Operators

CaveAgent introduces a novel framework that transforms LLMs into stateful runtime operators, enhancing their ability to manage complex ta...

arXiv - AI · 4 min ·
[2510.12121] Precise Attribute Intensity Control in Large Language Models via Targeted Representation Editing
Llms

[2510.12121] Precise Attribute Intensity Control in Large Language Models via Targeted Representation Editing

This paper introduces a method for precise control of attribute intensities in Large Language Models (LLMs) through targeted representati...

arXiv - Machine Learning · 4 min ·
[2502.07274] Forget Forgetting: Continual Learning in a World of Abundant Memory
Machine Learning

[2502.07274] Forget Forgetting: Continual Learning in a World of Abundant Memory

The paper explores continual learning (CL) in AI, proposing a shift from minimizing memory usage to leveraging abundant memory while addr...

arXiv - Machine Learning · 4 min ·
[2502.00213] Understanding Transformer Optimization via Gradient Heterogeneity
Machine Learning

[2502.00213] Understanding Transformer Optimization via Gradient Heterogeneity

This paper explores the optimization challenges of Transformer models, focusing on gradient heterogeneity and its impact on convergence w...

arXiv - Machine Learning · 4 min ·
[2509.00074] Language and Experience: A Computational Model of Social Learning in Complex Tasks
Machine Learning

[2509.00074] Language and Experience: A Computational Model of Social Learning in Complex Tasks

This article presents a computational model that explores how humans and AI can integrate linguistic guidance and direct experience for e...

arXiv - Machine Learning · 4 min ·
[2409.04332] Amortized Bayesian Workflow
Machine Learning

[2409.04332] Amortized Bayesian Workflow

The paper presents an Amortized Bayesian Workflow that combines fast amortized inference with accurate MCMC techniques, optimizing Bayesi...

arXiv - Machine Learning · 3 min ·
[2411.04760] Zero-Shot Temporal Resolution Domain Adaptation for Spiking Neural Networks
Machine Learning

[2411.04760] Zero-Shot Temporal Resolution Domain Adaptation for Spiking Neural Networks

This paper presents novel domain adaptation methods for Spiking Neural Networks (SNNs) to address performance drops due to mismatched tem...

arXiv - Machine Learning · 4 min ·
Previous Page 125 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime