AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 4 hours ago

Llms

[2603.10047] Toward Epistemic Stability: Engineering Consistent Procedures for Industrial LLM Hallucination Reduction

Abstract page for arXiv paper 2603.10047: Toward Epistemic Stability: Engineering Consistent Procedures for Industrial LLM Hallucination ...

arXiv - AI · 4 min · about 5 hours ago

Machine Learning

[2512.18388] Exploration vs. Fixation: Scaffolding Divergent and Convergent Thinking for Human-AI Co-Creation with Generative Models

Abstract page for arXiv paper 2512.18388: Exploration vs. Fixation: Scaffolding Divergent and Convergent Thinking for Human-AI Co-Creatio...

arXiv - AI · 4 min · about 5 hours ago

All Content

Llms

[2602.18443] From "Help" to Helpful: A Hierarchical Assessment of LLMs in Mental e-Health Applications

This study evaluates the effectiveness of large language models (LLMs) in generating subject lines for mental health counseling emails, h...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.19066] IDLM: Inverse-distilled Diffusion Language Models

The paper presents Inverse-distilled Diffusion Language Models (IDLM), a method that significantly accelerates inference in text generati...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.19033] A Markovian View of Iterative-Feedback Loops in Image Generative Models: Neural Resonance and Model Collapse

This paper explores iterative feedback loops in image generative models, introducing the concept of neural resonance and its implications...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.19020] Learning to Detect Language Model Training Data via Active Reconstruction

This paper introduces the Active Data Reconstruction Attack (ADRA), a novel approach to detect language model training data by leveraging...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.19017] Why ReLU? A Bit-Model Dichotomy for Deep Network Training

This paper investigates the complexity of training deep neural networks under a realistic bit-level model, contrasting it with traditiona...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.20094] CausalFlip: A Benchmark for LLM Causal Judgment Beyond Semantic Matching

The paper introduces CausalFlip, a benchmark for evaluating large language models' (LLMs) causal reasoning capabilities, emphasizing the ...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.20031] Latent Introspection: Models Can Detect Prior Concept Injections

This article presents findings on the latent introspection abilities of the Qwen 32B model, showing its capacity to detect prior concept ...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.18934] LoMime: Query-Efficient Membership Inference using Model Extraction in Label-Only Settings

The paper presents LoMime, a novel framework for membership inference attacks that operates efficiently under label-only conditions, sign...

arXiv - Machine Learning · 4 min · about 1 month ago

Ai Infrastructure

[2602.18910] SLDP: Semi-Local Differential Privacy for Density-Adaptive Analytics

The paper introduces Semi-Local Differential Privacy (SLDP), a framework that enhances privacy-preserving analytics by decoupling privacy...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.19519] Ada-RS: Adaptive Rejection Sampling for Selective Thinking

The paper introduces Ada-RS, an adaptive rejection sampling framework aimed at enhancing selective thinking in large language models (LLM...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.18851] Rank-Aware Spectral Bounds on Attention Logits for Stable Low-Precision Training

This paper presents a novel approach to stabilize low-precision training in transformer models by deriving rank-aware spectral bounds on ...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.19458] ComplLLM: Fine-tuning LLMs to Discover Complementary Signals for Decision-making

The paper presents ComplLLM, a framework for fine-tuning large language models (LLMs) to enhance decision-making by utilizing complementa...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.18825] Bayesian Lottery Ticket Hypothesis

The paper explores the Bayesian Lottery Ticket Hypothesis, demonstrating that sparse subnetworks in Bayesian neural networks can achieve ...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.18795] Vectorized Bayesian Inference for Latent Dirichlet-Tree Allocation

This paper presents a novel framework, Latent Dirichlet-Tree Allocation (LDTA), which enhances the traditional Latent Dirichlet Allocatio...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.19390] Artificial Intelligence for Modeling & Simulation in Digital Twins

This article explores the integration of artificial intelligence with modeling and simulation in digital twins, highlighting their roles ...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.18733] Prior Aware Memorization: An Efficient Metric for Distinguishing Memorization from Generalization in Large Language Models

The paper introduces Prior Aware Memorization, a new metric for distinguishing genuine memorization from generalization in large language...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.18658] Communication-Efficient Personalized Adaptation via Federated-Local Model Merging

The paper presents Potara, a framework for federated personalization that merges general and personalized models, improving efficiency an...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.19128] K-Search: LLM Kernel Generation via Co-Evolving Intrinsic World Model

The paper presents K-Search, a novel framework for optimizing GPU kernels using a co-evolving intrinsic world model, significantly improv...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.18647] Information-Guided Noise Allocation for Efficient Diffusion Training

The paper presents InfoNoise, a data-adaptive noise scheduling method for diffusion training, enhancing efficiency and performance by uti...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.18645] Adaptive Time Series Reasoning via Segment Selection

The paper presents ARTIST, a novel approach to time series reasoning that utilizes adaptive segment selection to improve accuracy in answ...

arXiv - Machine Learning · 4 min · about 1 month ago

Previous Page 104 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence

[2603.10047] Toward Epistemic Stability: Engineering Consistent Procedures for Industrial LLM Hallucination Reduction

[2512.18388] Exploration vs. Fixation: Scaffolding Divergent and Convergent Thinking for Human-AI Co-Creation with Generative Models

All Content

[2602.18443] From "Help" to Helpful: A Hierarchical Assessment of LLMs in Mental e-Health Applications

[2602.19066] IDLM: Inverse-distilled Diffusion Language Models

[2602.19033] A Markovian View of Iterative-Feedback Loops in Image Generative Models: Neural Resonance and Model Collapse

[2602.19020] Learning to Detect Language Model Training Data via Active Reconstruction

[2602.19017] Why ReLU? A Bit-Model Dichotomy for Deep Network Training

[2602.20094] CausalFlip: A Benchmark for LLM Causal Judgment Beyond Semantic Matching

[2602.20031] Latent Introspection: Models Can Detect Prior Concept Injections

[2602.18934] LoMime: Query-Efficient Membership Inference using Model Extraction in Label-Only Settings

[2602.18910] SLDP: Semi-Local Differential Privacy for Density-Adaptive Analytics

[2602.19519] Ada-RS: Adaptive Rejection Sampling for Selective Thinking

[2602.18851] Rank-Aware Spectral Bounds on Attention Logits for Stable Low-Precision Training

[2602.19458] ComplLLM: Fine-tuning LLMs to Discover Complementary Signals for Decision-making

[2602.18825] Bayesian Lottery Ticket Hypothesis

[2602.18795] Vectorized Bayesian Inference for Latent Dirichlet-Tree Allocation

[2602.19390] Artificial Intelligence for Modeling & Simulation in Digital Twins

[2602.18733] Prior Aware Memorization: An Efficient Metric for Distinguishing Memorization from Generalization in Large Language Models

[2602.18658] Communication-Efficient Personalized Adaptation via Federated-Local Model Merging

[2602.19128] K-Search: LLM Kernel Generation via Co-Evolving Intrinsic World Model

[2602.18647] Information-Guided Noise Allocation for Efficient Diffusion Training

[2602.18645] Adaptive Time Series Reasoning via Segment Selection

Related Topics

Stay updated with AI News