AI Infrastructure

GPUs, training clusters, MLOps, and deployment

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Ai Infrastructure

Siemens, NVIDIA hit chip verification milestone for AI

AI News - General ·
Ai Infrastructure

Dell and HIVE partner to deploy Nvidia’s next-generation AI chips

AI News - General · 1 min ·

All Content

[2501.16534] Targeting Alignment: Extracting Safety Classifiers of Aligned LLMs
Llms

[2501.16534] Targeting Alignment: Extracting Safety Classifiers of Aligned LLMs

This article presents a novel technique for extracting safety classifiers from aligned large language models (LLMs) to address vulnerabil...

arXiv - AI · 4 min ·
[2501.14406] Adaptive Rank Allocation for Federated Parameter-Efficient Fine-Tuning of Language Models
Llms

[2501.14406] Adaptive Rank Allocation for Federated Parameter-Efficient Fine-Tuning of Language Models

The paper presents FedARA, an innovative framework for federated parameter-efficient fine-tuning of language models, addressing data hete...

arXiv - Machine Learning · 4 min ·
[2508.10480] Pinet: Optimizing hard-constrained neural networks with orthogonal projection layers
Machine Learning

[2508.10480] Pinet: Optimizing hard-constrained neural networks with orthogonal projection layers

The paper introduces $ ext{Pinet}$, a novel output layer for neural networks that optimizes hard constraints using orthogonal projection ...

arXiv - Machine Learning · 3 min ·
[2507.12257] Robust Causal Discovery in Real-World Time Series with Power-Laws
Machine Learning

[2507.12257] Robust Causal Discovery in Real-World Time Series with Power-Laws

This paper presents a novel method for causal discovery in time series data, leveraging power-law distributions to enhance robustness aga...

arXiv - Machine Learning · 3 min ·
[2506.14202] DiffusionBlocks: Block-wise Neural Network Training via Diffusion Interpretation
Machine Learning

[2506.14202] DiffusionBlocks: Block-wise Neural Network Training via Diffusion Interpretation

The paper introduces DiffusionBlocks, a framework for block-wise training of neural networks that reduces memory bottlenecks while mainta...

arXiv - Machine Learning · 4 min ·
[2602.11348] AgentNoiseBench: Benchmarking Robustness of Tool-Using LLM Agents Under Noisy Condition
Llms

[2602.11348] AgentNoiseBench: Benchmarking Robustness of Tool-Using LLM Agents Under Noisy Condition

The paper introduces AgentNoiseBench, a framework for evaluating the robustness of tool-using LLM agents under noisy conditions, highligh...

arXiv - AI · 4 min ·
[2505.19427] WINA: Weight Informed Neuron Activation for Accelerating Large Language Model Inference
Llms

[2505.19427] WINA: Weight Informed Neuron Activation for Accelerating Large Language Model Inference

The paper introduces WINA, a novel framework for efficient inference in large language models (LLMs) that optimally combines hidden state...

arXiv - Machine Learning · 4 min ·
[2602.02050] Rethinking the Role of Entropy in Optimizing Tool-Use Behaviors for Large Language Model Agents
Llms

[2602.02050] Rethinking the Role of Entropy in Optimizing Tool-Use Behaviors for Large Language Model Agents

This article explores the role of entropy in optimizing tool-use behaviors for Large Language Model (LLM) agents, highlighting the correl...

arXiv - AI · 4 min ·
[2504.06768] FedMerge: Federated Personalization via Model Merging
Machine Learning

[2504.06768] FedMerge: Federated Personalization via Model Merging

The paper introduces FedMerge, a novel approach in federated learning that enables personalized model creation for clients by merging mul...

arXiv - Machine Learning · 4 min ·
[2504.05615] FedEFC: Federated Learning Using Enhanced Forward Correction Against Noisy Labels
Machine Learning

[2504.05615] FedEFC: Federated Learning Using Enhanced Forward Correction Against Noisy Labels

The paper presents FedEFC, a novel approach to federated learning that addresses the challenges posed by noisy labels through techniques ...

arXiv - Machine Learning · 4 min ·
[2601.01569] CaveAgent: Transforming LLMs into Stateful Runtime Operators
Llms

[2601.01569] CaveAgent: Transforming LLMs into Stateful Runtime Operators

CaveAgent introduces a novel framework that transforms LLMs into stateful runtime operators, enhancing their ability to manage complex ta...

arXiv - AI · 4 min ·
[2510.12121] Precise Attribute Intensity Control in Large Language Models via Targeted Representation Editing
Llms

[2510.12121] Precise Attribute Intensity Control in Large Language Models via Targeted Representation Editing

This paper introduces a method for precise control of attribute intensities in Large Language Models (LLMs) through targeted representati...

arXiv - Machine Learning · 4 min ·
[2502.07274] Forget Forgetting: Continual Learning in a World of Abundant Memory
Machine Learning

[2502.07274] Forget Forgetting: Continual Learning in a World of Abundant Memory

The paper explores continual learning (CL) in AI, proposing a shift from minimizing memory usage to leveraging abundant memory while addr...

arXiv - Machine Learning · 4 min ·
[2502.00213] Understanding Transformer Optimization via Gradient Heterogeneity
Machine Learning

[2502.00213] Understanding Transformer Optimization via Gradient Heterogeneity

This paper explores the optimization challenges of Transformer models, focusing on gradient heterogeneity and its impact on convergence w...

arXiv - Machine Learning · 4 min ·
[2509.00074] Language and Experience: A Computational Model of Social Learning in Complex Tasks
Machine Learning

[2509.00074] Language and Experience: A Computational Model of Social Learning in Complex Tasks

This article presents a computational model that explores how humans and AI can integrate linguistic guidance and direct experience for e...

arXiv - Machine Learning · 4 min ·
[2409.04332] Amortized Bayesian Workflow
Machine Learning

[2409.04332] Amortized Bayesian Workflow

The paper presents an Amortized Bayesian Workflow that combines fast amortized inference with accurate MCMC techniques, optimizing Bayesi...

arXiv - Machine Learning · 3 min ·
[2411.04760] Zero-Shot Temporal Resolution Domain Adaptation for Spiking Neural Networks
Machine Learning

[2411.04760] Zero-Shot Temporal Resolution Domain Adaptation for Spiking Neural Networks

This paper presents novel domain adaptation methods for Spiking Neural Networks (SNNs) to address performance drops due to mismatched tem...

arXiv - Machine Learning · 4 min ·
[2602.16690] Synthetic-Powered Multiple Testing with FDR Control
Machine Learning

[2602.16690] Synthetic-Powered Multiple Testing with FDR Control

The paper presents SynthBH, a novel method for multiple hypothesis testing that integrates synthetic data to enhance statistical inferenc...

arXiv - Machine Learning · 3 min ·
[2502.01160] Scalable Precise Computation of Shannon Entropy
Machine Learning

[2502.01160] Scalable Precise Computation of Shannon Entropy

This paper presents a scalable tool, PSE, for precise computation of Shannon entropy, optimizing the process to enhance efficiency in qua...

arXiv - AI · 4 min ·
[2602.16660] Align Once, Benefit Multilingually: Enforcing Multilingual Consistency for LLM Safety Alignment
Llms

[2602.16660] Align Once, Benefit Multilingually: Enforcing Multilingual Consistency for LLM Safety Alignment

The paper presents a method for enhancing multilingual safety alignment in large language models (LLMs) using a resource-efficient Multi-...

arXiv - Machine Learning · 4 min ·
Previous Page 130 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime