AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · 16 minutes ago

Ai Infrastructure

[2603.28781] When GPUs Fail Quietly: Observability-Aware Early Warning Beyond Numeric Telemetry

Abstract page for arXiv paper 2603.28781: When GPUs Fail Quietly: Observability-Aware Early Warning Beyond Numeric Telemetry

arXiv - Machine Learning · 4 min · 25 minutes ago

Machine Learning

[2603.18104] Adaptive Domain Models: Bayesian Evolution, Warm Rotation, and Principled Training for Geometric and Neuromorphic AI

Abstract page for arXiv paper 2603.18104: Adaptive Domain Models: Bayesian Evolution, Warm Rotation, and Principled Training for Geometri...

arXiv - AI · 4 min · 25 minutes ago

All Content

Machine Learning

[2602.19143] Incremental Learning of Sparse Attention Patterns in Transformers

This paper explores how transformers learn through incremental acquisition of sparse attention patterns, revealing shifts in learning dyn...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.19142] Celo2: Towards Learned Optimization Free Lunch

The paper 'Celo2: Towards Learned Optimization Free Lunch' presents a novel learned optimizer that significantly reduces the computationa...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.19131] Test-Time Learning of Causal Structure from Interventional Data

The paper presents TICL, a novel method for causal structure learning from interventional data, enhancing generalization across diverse s...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.19126] Robust Predictive Uncertainty and Double Descent in Contaminated Bayesian Random Features

This paper presents a robust Bayesian approach to random feature regression, addressing prior and likelihood misspecification through Hub...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.18447] ConfSpec: Efficient Step-Level Speculative Reasoning via Confidence-Gated Verification

The paper presents ConfSpec, a novel framework for efficient step-level speculative reasoning in large language models, achieving signifi...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.18443] From "Help" to Helpful: A Hierarchical Assessment of LLMs in Mental e-Health Applications

This study evaluates the effectiveness of large language models (LLMs) in generating subject lines for mental health counseling emails, h...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.19066] IDLM: Inverse-distilled Diffusion Language Models

The paper presents Inverse-distilled Diffusion Language Models (IDLM), a method that significantly accelerates inference in text generati...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.19033] A Markovian View of Iterative-Feedback Loops in Image Generative Models: Neural Resonance and Model Collapse

This paper explores iterative feedback loops in image generative models, introducing the concept of neural resonance and its implications...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.19020] Learning to Detect Language Model Training Data via Active Reconstruction

This paper introduces the Active Data Reconstruction Attack (ADRA), a novel approach to detect language model training data by leveraging...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.19017] Why ReLU? A Bit-Model Dichotomy for Deep Network Training

This paper investigates the complexity of training deep neural networks under a realistic bit-level model, contrasting it with traditiona...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.20094] CausalFlip: A Benchmark for LLM Causal Judgment Beyond Semantic Matching

The paper introduces CausalFlip, a benchmark for evaluating large language models' (LLMs) causal reasoning capabilities, emphasizing the ...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.20031] Latent Introspection: Models Can Detect Prior Concept Injections

This article presents findings on the latent introspection abilities of the Qwen 32B model, showing its capacity to detect prior concept ...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.18934] LoMime: Query-Efficient Membership Inference using Model Extraction in Label-Only Settings

The paper presents LoMime, a novel framework for membership inference attacks that operates efficiently under label-only conditions, sign...

arXiv - Machine Learning · 4 min · about 1 month ago

Ai Infrastructure

[2602.18910] SLDP: Semi-Local Differential Privacy for Density-Adaptive Analytics

The paper introduces Semi-Local Differential Privacy (SLDP), a framework that enhances privacy-preserving analytics by decoupling privacy...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.19519] Ada-RS: Adaptive Rejection Sampling for Selective Thinking

The paper introduces Ada-RS, an adaptive rejection sampling framework aimed at enhancing selective thinking in large language models (LLM...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.18851] Rank-Aware Spectral Bounds on Attention Logits for Stable Low-Precision Training

This paper presents a novel approach to stabilize low-precision training in transformer models by deriving rank-aware spectral bounds on ...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.19458] ComplLLM: Fine-tuning LLMs to Discover Complementary Signals for Decision-making

The paper presents ComplLLM, a framework for fine-tuning large language models (LLMs) to enhance decision-making by utilizing complementa...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.18825] Bayesian Lottery Ticket Hypothesis

The paper explores the Bayesian Lottery Ticket Hypothesis, demonstrating that sparse subnetworks in Bayesian neural networks can achieve ...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.18795] Vectorized Bayesian Inference for Latent Dirichlet-Tree Allocation

This paper presents a novel framework, Latent Dirichlet-Tree Allocation (LDTA), which enhances the traditional Latent Dirichlet Allocatio...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.19390] Artificial Intelligence for Modeling & Simulation in Digital Twins

This article explores the integration of artificial intelligence with modeling and simulation in digital twins, highlighting their roles ...

arXiv - AI · 4 min · about 1 month ago

Previous Page 101 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence

[2603.28781] When GPUs Fail Quietly: Observability-Aware Early Warning Beyond Numeric Telemetry

[2603.18104] Adaptive Domain Models: Bayesian Evolution, Warm Rotation, and Principled Training for Geometric and Neuromorphic AI

All Content

[2602.19143] Incremental Learning of Sparse Attention Patterns in Transformers

[2602.19142] Celo2: Towards Learned Optimization Free Lunch

[2602.19131] Test-Time Learning of Causal Structure from Interventional Data

[2602.19126] Robust Predictive Uncertainty and Double Descent in Contaminated Bayesian Random Features

[2602.18447] ConfSpec: Efficient Step-Level Speculative Reasoning via Confidence-Gated Verification

[2602.18443] From "Help" to Helpful: A Hierarchical Assessment of LLMs in Mental e-Health Applications

[2602.19066] IDLM: Inverse-distilled Diffusion Language Models

[2602.19033] A Markovian View of Iterative-Feedback Loops in Image Generative Models: Neural Resonance and Model Collapse

[2602.19020] Learning to Detect Language Model Training Data via Active Reconstruction

[2602.19017] Why ReLU? A Bit-Model Dichotomy for Deep Network Training

[2602.20094] CausalFlip: A Benchmark for LLM Causal Judgment Beyond Semantic Matching

[2602.20031] Latent Introspection: Models Can Detect Prior Concept Injections

[2602.18934] LoMime: Query-Efficient Membership Inference using Model Extraction in Label-Only Settings

[2602.18910] SLDP: Semi-Local Differential Privacy for Density-Adaptive Analytics

[2602.19519] Ada-RS: Adaptive Rejection Sampling for Selective Thinking

[2602.18851] Rank-Aware Spectral Bounds on Attention Logits for Stable Low-Precision Training

[2602.19458] ComplLLM: Fine-tuning LLMs to Discover Complementary Signals for Decision-making

[2602.18825] Bayesian Lottery Ticket Hypothesis

[2602.18795] Vectorized Bayesian Inference for Latent Dirichlet-Tree Allocation

[2602.19390] Artificial Intelligence for Modeling & Simulation in Digital Twins

Related Topics

Stay updated with AI News