AI Infrastructure

GPUs, training clusters, MLOps, and deployment

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
[2603.28781] When GPUs Fail Quietly: Observability-Aware Early Warning Beyond Numeric Telemetry
Ai Infrastructure

[2603.28781] When GPUs Fail Quietly: Observability-Aware Early Warning Beyond Numeric Telemetry

Abstract page for arXiv paper 2603.28781: When GPUs Fail Quietly: Observability-Aware Early Warning Beyond Numeric Telemetry

arXiv - Machine Learning · 4 min ·
[2603.18104] Adaptive Domain Models: Bayesian Evolution, Warm Rotation, and Principled Training for Geometric and Neuromorphic AI
Machine Learning

[2603.18104] Adaptive Domain Models: Bayesian Evolution, Warm Rotation, and Principled Training for Geometric and Neuromorphic AI

Abstract page for arXiv paper 2603.18104: Adaptive Domain Models: Bayesian Evolution, Warm Rotation, and Principled Training for Geometri...

arXiv - AI · 4 min ·

All Content

[2602.19143] Incremental Learning of Sparse Attention Patterns in Transformers
Machine Learning

[2602.19143] Incremental Learning of Sparse Attention Patterns in Transformers

This paper explores how transformers learn through incremental acquisition of sparse attention patterns, revealing shifts in learning dyn...

arXiv - Machine Learning · 3 min ·
[2602.19142] Celo2: Towards Learned Optimization Free Lunch
Llms

[2602.19142] Celo2: Towards Learned Optimization Free Lunch

The paper 'Celo2: Towards Learned Optimization Free Lunch' presents a novel learned optimizer that significantly reduces the computationa...

arXiv - AI · 3 min ·
[2602.19131] Test-Time Learning of Causal Structure from Interventional Data
Machine Learning

[2602.19131] Test-Time Learning of Causal Structure from Interventional Data

The paper presents TICL, a novel method for causal structure learning from interventional data, enhancing generalization across diverse s...

arXiv - AI · 3 min ·
[2602.19126] Robust Predictive Uncertainty and Double Descent in Contaminated Bayesian Random Features
Machine Learning

[2602.19126] Robust Predictive Uncertainty and Double Descent in Contaminated Bayesian Random Features

This paper presents a robust Bayesian approach to random feature regression, addressing prior and likelihood misspecification through Hub...

arXiv - Machine Learning · 4 min ·
[2602.18447] ConfSpec: Efficient Step-Level Speculative Reasoning via Confidence-Gated Verification
Llms

[2602.18447] ConfSpec: Efficient Step-Level Speculative Reasoning via Confidence-Gated Verification

The paper presents ConfSpec, a novel framework for efficient step-level speculative reasoning in large language models, achieving signifi...

arXiv - AI · 3 min ·
[2602.18443] From "Help" to Helpful: A Hierarchical Assessment of LLMs in Mental e-Health Applications
Llms

[2602.18443] From "Help" to Helpful: A Hierarchical Assessment of LLMs in Mental e-Health Applications

This study evaluates the effectiveness of large language models (LLMs) in generating subject lines for mental health counseling emails, h...

arXiv - AI · 3 min ·
[2602.19066] IDLM: Inverse-distilled Diffusion Language Models
Llms

[2602.19066] IDLM: Inverse-distilled Diffusion Language Models

The paper presents Inverse-distilled Diffusion Language Models (IDLM), a method that significantly accelerates inference in text generati...

arXiv - AI · 3 min ·
[2602.19033] A Markovian View of Iterative-Feedback Loops in Image Generative Models: Neural Resonance and Model Collapse
Machine Learning

[2602.19033] A Markovian View of Iterative-Feedback Loops in Image Generative Models: Neural Resonance and Model Collapse

This paper explores iterative feedback loops in image generative models, introducing the concept of neural resonance and its implications...

arXiv - AI · 4 min ·
[2602.19020] Learning to Detect Language Model Training Data via Active Reconstruction
Llms

[2602.19020] Learning to Detect Language Model Training Data via Active Reconstruction

This paper introduces the Active Data Reconstruction Attack (ADRA), a novel approach to detect language model training data by leveraging...

arXiv - AI · 4 min ·
[2602.19017] Why ReLU? A Bit-Model Dichotomy for Deep Network Training
Machine Learning

[2602.19017] Why ReLU? A Bit-Model Dichotomy for Deep Network Training

This paper investigates the complexity of training deep neural networks under a realistic bit-level model, contrasting it with traditiona...

arXiv - Machine Learning · 4 min ·
[2602.20094] CausalFlip: A Benchmark for LLM Causal Judgment Beyond Semantic Matching
Llms

[2602.20094] CausalFlip: A Benchmark for LLM Causal Judgment Beyond Semantic Matching

The paper introduces CausalFlip, a benchmark for evaluating large language models' (LLMs) causal reasoning capabilities, emphasizing the ...

arXiv - AI · 4 min ·
[2602.20031] Latent Introspection: Models Can Detect Prior Concept Injections
Machine Learning

[2602.20031] Latent Introspection: Models Can Detect Prior Concept Injections

This article presents findings on the latent introspection abilities of the Qwen 32B model, showing its capacity to detect prior concept ...

arXiv - Machine Learning · 3 min ·
[2602.18934] LoMime: Query-Efficient Membership Inference using Model Extraction in Label-Only Settings
Machine Learning

[2602.18934] LoMime: Query-Efficient Membership Inference using Model Extraction in Label-Only Settings

The paper presents LoMime, a novel framework for membership inference attacks that operates efficiently under label-only conditions, sign...

arXiv - Machine Learning · 4 min ·
[2602.18910] SLDP: Semi-Local Differential Privacy for Density-Adaptive Analytics
Ai Infrastructure

[2602.18910] SLDP: Semi-Local Differential Privacy for Density-Adaptive Analytics

The paper introduces Semi-Local Differential Privacy (SLDP), a framework that enhances privacy-preserving analytics by decoupling privacy...

arXiv - Machine Learning · 3 min ·
[2602.19519] Ada-RS: Adaptive Rejection Sampling for Selective Thinking
Llms

[2602.19519] Ada-RS: Adaptive Rejection Sampling for Selective Thinking

The paper introduces Ada-RS, an adaptive rejection sampling framework aimed at enhancing selective thinking in large language models (LLM...

arXiv - Machine Learning · 3 min ·
[2602.18851] Rank-Aware Spectral Bounds on Attention Logits for Stable Low-Precision Training
Machine Learning

[2602.18851] Rank-Aware Spectral Bounds on Attention Logits for Stable Low-Precision Training

This paper presents a novel approach to stabilize low-precision training in transformer models by deriving rank-aware spectral bounds on ...

arXiv - AI · 3 min ·
[2602.19458] ComplLLM: Fine-tuning LLMs to Discover Complementary Signals for Decision-making
Llms

[2602.19458] ComplLLM: Fine-tuning LLMs to Discover Complementary Signals for Decision-making

The paper presents ComplLLM, a framework for fine-tuning large language models (LLMs) to enhance decision-making by utilizing complementa...

arXiv - AI · 3 min ·
[2602.18825] Bayesian Lottery Ticket Hypothesis
Machine Learning

[2602.18825] Bayesian Lottery Ticket Hypothesis

The paper explores the Bayesian Lottery Ticket Hypothesis, demonstrating that sparse subnetworks in Bayesian neural networks can achieve ...

arXiv - Machine Learning · 4 min ·
[2602.18795] Vectorized Bayesian Inference for Latent Dirichlet-Tree Allocation
Machine Learning

[2602.18795] Vectorized Bayesian Inference for Latent Dirichlet-Tree Allocation

This paper presents a novel framework, Latent Dirichlet-Tree Allocation (LDTA), which enhances the traditional Latent Dirichlet Allocatio...

arXiv - Machine Learning · 3 min ·
[2602.19390] Artificial Intelligence for Modeling & Simulation in Digital Twins
Machine Learning

[2602.19390] Artificial Intelligence for Modeling & Simulation in Digital Twins

This article explores the integration of artificial intelligence with modeling and simulation in digital twins, highlighting their roles ...

arXiv - AI · 4 min ·
Previous Page 101 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime