AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 6 hours ago

Ai Infrastructure

Siemens, NVIDIA hit chip verification milestone for AI

AI News - General · about 11 hours ago

Ai Infrastructure

Dell and HIVE partner to deploy Nvidia’s next-generation AI chips

AI News - General · 1 min · about 11 hours ago

All Content

Llms

[2501.16534] Targeting Alignment: Extracting Safety Classifiers of Aligned LLMs

This article presents a novel technique for extracting safety classifiers from aligned large language models (LLMs) to address vulnerabil...

arXiv - AI · 4 min · about 2 months ago

Llms

[2501.14406] Adaptive Rank Allocation for Federated Parameter-Efficient Fine-Tuning of Language Models

The paper presents FedARA, an innovative framework for federated parameter-efficient fine-tuning of language models, addressing data hete...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2508.10480] Pinet: Optimizing hard-constrained neural networks with orthogonal projection layers

The paper introduces $ ext{Pinet}$, a novel output layer for neural networks that optimizes hard constraints using orthogonal projection ...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2507.12257] Robust Causal Discovery in Real-World Time Series with Power-Laws

This paper presents a novel method for causal discovery in time series data, leveraging power-law distributions to enhance robustness aga...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2506.14202] DiffusionBlocks: Block-wise Neural Network Training via Diffusion Interpretation

The paper introduces DiffusionBlocks, a framework for block-wise training of neural networks that reduces memory bottlenecks while mainta...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.11348] AgentNoiseBench: Benchmarking Robustness of Tool-Using LLM Agents Under Noisy Condition

The paper introduces AgentNoiseBench, a framework for evaluating the robustness of tool-using LLM agents under noisy conditions, highligh...

arXiv - AI · 4 min · about 2 months ago

Llms

[2505.19427] WINA: Weight Informed Neuron Activation for Accelerating Large Language Model Inference

The paper introduces WINA, a novel framework for efficient inference in large language models (LLMs) that optimally combines hidden state...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.02050] Rethinking the Role of Entropy in Optimizing Tool-Use Behaviors for Large Language Model Agents

This article explores the role of entropy in optimizing tool-use behaviors for Large Language Model (LLM) agents, highlighting the correl...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2504.06768] FedMerge: Federated Personalization via Model Merging

The paper introduces FedMerge, a novel approach in federated learning that enables personalized model creation for clients by merging mul...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2504.05615] FedEFC: Federated Learning Using Enhanced Forward Correction Against Noisy Labels

The paper presents FedEFC, a novel approach to federated learning that addresses the challenges posed by noisy labels through techniques ...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2601.01569] CaveAgent: Transforming LLMs into Stateful Runtime Operators

CaveAgent introduces a novel framework that transforms LLMs into stateful runtime operators, enhancing their ability to manage complex ta...

arXiv - AI · 4 min · about 2 months ago

Llms

[2510.12121] Precise Attribute Intensity Control in Large Language Models via Targeted Representation Editing

This paper introduces a method for precise control of attribute intensities in Large Language Models (LLMs) through targeted representati...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2502.07274] Forget Forgetting: Continual Learning in a World of Abundant Memory

The paper explores continual learning (CL) in AI, proposing a shift from minimizing memory usage to leveraging abundant memory while addr...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2502.00213] Understanding Transformer Optimization via Gradient Heterogeneity

This paper explores the optimization challenges of Transformer models, focusing on gradient heterogeneity and its impact on convergence w...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2509.00074] Language and Experience: A Computational Model of Social Learning in Complex Tasks

This article presents a computational model that explores how humans and AI can integrate linguistic guidance and direct experience for e...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2409.04332] Amortized Bayesian Workflow

The paper presents an Amortized Bayesian Workflow that combines fast amortized inference with accurate MCMC techniques, optimizing Bayesi...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2411.04760] Zero-Shot Temporal Resolution Domain Adaptation for Spiking Neural Networks

This paper presents novel domain adaptation methods for Spiking Neural Networks (SNNs) to address performance drops due to mismatched tem...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.16690] Synthetic-Powered Multiple Testing with FDR Control

The paper presents SynthBH, a novel method for multiple hypothesis testing that integrates synthetic data to enhance statistical inferenc...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2502.01160] Scalable Precise Computation of Shannon Entropy

This paper presents a scalable tool, PSE, for precise computation of Shannon entropy, optimizing the process to enhance efficiency in qua...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.16660] Align Once, Benefit Multilingually: Enforcing Multilingual Consistency for LLM Safety Alignment

The paper presents a method for enhancing multilingual safety alignment in large language models (LLMs) using a resource-efficient Multi-...

arXiv - Machine Learning · 4 min · about 2 months ago

Previous Page 130 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence

Siemens, NVIDIA hit chip verification milestone for AI

Dell and HIVE partner to deploy Nvidia’s next-generation AI chips

All Content

[2501.16534] Targeting Alignment: Extracting Safety Classifiers of Aligned LLMs

[2501.14406] Adaptive Rank Allocation for Federated Parameter-Efficient Fine-Tuning of Language Models

[2508.10480] Pinet: Optimizing hard-constrained neural networks with orthogonal projection layers

[2507.12257] Robust Causal Discovery in Real-World Time Series with Power-Laws

[2506.14202] DiffusionBlocks: Block-wise Neural Network Training via Diffusion Interpretation

[2602.11348] AgentNoiseBench: Benchmarking Robustness of Tool-Using LLM Agents Under Noisy Condition

[2505.19427] WINA: Weight Informed Neuron Activation for Accelerating Large Language Model Inference

[2602.02050] Rethinking the Role of Entropy in Optimizing Tool-Use Behaviors for Large Language Model Agents

[2504.06768] FedMerge: Federated Personalization via Model Merging

[2504.05615] FedEFC: Federated Learning Using Enhanced Forward Correction Against Noisy Labels

[2601.01569] CaveAgent: Transforming LLMs into Stateful Runtime Operators

[2510.12121] Precise Attribute Intensity Control in Large Language Models via Targeted Representation Editing

[2502.07274] Forget Forgetting: Continual Learning in a World of Abundant Memory

[2502.00213] Understanding Transformer Optimization via Gradient Heterogeneity

[2509.00074] Language and Experience: A Computational Model of Social Learning in Complex Tasks

[2409.04332] Amortized Bayesian Workflow

[2411.04760] Zero-Shot Temporal Resolution Domain Adaptation for Spiking Neural Networks

[2602.16690] Synthetic-Powered Multiple Testing with FDR Control

[2502.01160] Scalable Precise Computation of Shannon Entropy

[2602.16660] Align Once, Benefit Multilingually: Enforcing Multilingual Consistency for LLM Safety Alignment

Related Topics

Stay updated with AI News