AI Infrastructure

GPUs, training clusters, MLOps, and deployment

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
[2603.10047] Toward Epistemic Stability: Engineering Consistent Procedures for Industrial LLM Hallucination Reduction
Llms

[2603.10047] Toward Epistemic Stability: Engineering Consistent Procedures for Industrial LLM Hallucination Reduction

Abstract page for arXiv paper 2603.10047: Toward Epistemic Stability: Engineering Consistent Procedures for Industrial LLM Hallucination ...

arXiv - AI · 4 min ·
[2512.18388] Exploration vs. Fixation: Scaffolding Divergent and Convergent Thinking for Human-AI Co-Creation with Generative Models
Machine Learning

[2512.18388] Exploration vs. Fixation: Scaffolding Divergent and Convergent Thinking for Human-AI Co-Creation with Generative Models

Abstract page for arXiv paper 2512.18388: Exploration vs. Fixation: Scaffolding Divergent and Convergent Thinking for Human-AI Co-Creatio...

arXiv - AI · 4 min ·

All Content

[2602.18443] From "Help" to Helpful: A Hierarchical Assessment of LLMs in Mental e-Health Applications
Llms

[2602.18443] From "Help" to Helpful: A Hierarchical Assessment of LLMs in Mental e-Health Applications

This study evaluates the effectiveness of large language models (LLMs) in generating subject lines for mental health counseling emails, h...

arXiv - AI · 3 min ·
[2602.19066] IDLM: Inverse-distilled Diffusion Language Models
Llms

[2602.19066] IDLM: Inverse-distilled Diffusion Language Models

The paper presents Inverse-distilled Diffusion Language Models (IDLM), a method that significantly accelerates inference in text generati...

arXiv - AI · 3 min ·
[2602.19033] A Markovian View of Iterative-Feedback Loops in Image Generative Models: Neural Resonance and Model Collapse
Machine Learning

[2602.19033] A Markovian View of Iterative-Feedback Loops in Image Generative Models: Neural Resonance and Model Collapse

This paper explores iterative feedback loops in image generative models, introducing the concept of neural resonance and its implications...

arXiv - AI · 4 min ·
[2602.19020] Learning to Detect Language Model Training Data via Active Reconstruction
Llms

[2602.19020] Learning to Detect Language Model Training Data via Active Reconstruction

This paper introduces the Active Data Reconstruction Attack (ADRA), a novel approach to detect language model training data by leveraging...

arXiv - AI · 4 min ·
[2602.19017] Why ReLU? A Bit-Model Dichotomy for Deep Network Training
Machine Learning

[2602.19017] Why ReLU? A Bit-Model Dichotomy for Deep Network Training

This paper investigates the complexity of training deep neural networks under a realistic bit-level model, contrasting it with traditiona...

arXiv - Machine Learning · 4 min ·
[2602.20094] CausalFlip: A Benchmark for LLM Causal Judgment Beyond Semantic Matching
Llms

[2602.20094] CausalFlip: A Benchmark for LLM Causal Judgment Beyond Semantic Matching

The paper introduces CausalFlip, a benchmark for evaluating large language models' (LLMs) causal reasoning capabilities, emphasizing the ...

arXiv - AI · 4 min ·
[2602.20031] Latent Introspection: Models Can Detect Prior Concept Injections
Machine Learning

[2602.20031] Latent Introspection: Models Can Detect Prior Concept Injections

This article presents findings on the latent introspection abilities of the Qwen 32B model, showing its capacity to detect prior concept ...

arXiv - Machine Learning · 3 min ·
[2602.18934] LoMime: Query-Efficient Membership Inference using Model Extraction in Label-Only Settings
Machine Learning

[2602.18934] LoMime: Query-Efficient Membership Inference using Model Extraction in Label-Only Settings

The paper presents LoMime, a novel framework for membership inference attacks that operates efficiently under label-only conditions, sign...

arXiv - Machine Learning · 4 min ·
[2602.18910] SLDP: Semi-Local Differential Privacy for Density-Adaptive Analytics
Ai Infrastructure

[2602.18910] SLDP: Semi-Local Differential Privacy for Density-Adaptive Analytics

The paper introduces Semi-Local Differential Privacy (SLDP), a framework that enhances privacy-preserving analytics by decoupling privacy...

arXiv - Machine Learning · 3 min ·
[2602.19519] Ada-RS: Adaptive Rejection Sampling for Selective Thinking
Llms

[2602.19519] Ada-RS: Adaptive Rejection Sampling for Selective Thinking

The paper introduces Ada-RS, an adaptive rejection sampling framework aimed at enhancing selective thinking in large language models (LLM...

arXiv - Machine Learning · 3 min ·
[2602.18851] Rank-Aware Spectral Bounds on Attention Logits for Stable Low-Precision Training
Machine Learning

[2602.18851] Rank-Aware Spectral Bounds on Attention Logits for Stable Low-Precision Training

This paper presents a novel approach to stabilize low-precision training in transformer models by deriving rank-aware spectral bounds on ...

arXiv - AI · 3 min ·
[2602.19458] ComplLLM: Fine-tuning LLMs to Discover Complementary Signals for Decision-making
Llms

[2602.19458] ComplLLM: Fine-tuning LLMs to Discover Complementary Signals for Decision-making

The paper presents ComplLLM, a framework for fine-tuning large language models (LLMs) to enhance decision-making by utilizing complementa...

arXiv - AI · 3 min ·
[2602.18825] Bayesian Lottery Ticket Hypothesis
Machine Learning

[2602.18825] Bayesian Lottery Ticket Hypothesis

The paper explores the Bayesian Lottery Ticket Hypothesis, demonstrating that sparse subnetworks in Bayesian neural networks can achieve ...

arXiv - Machine Learning · 4 min ·
[2602.18795] Vectorized Bayesian Inference for Latent Dirichlet-Tree Allocation
Machine Learning

[2602.18795] Vectorized Bayesian Inference for Latent Dirichlet-Tree Allocation

This paper presents a novel framework, Latent Dirichlet-Tree Allocation (LDTA), which enhances the traditional Latent Dirichlet Allocatio...

arXiv - Machine Learning · 3 min ·
[2602.19390] Artificial Intelligence for Modeling & Simulation in Digital Twins
Machine Learning

[2602.19390] Artificial Intelligence for Modeling & Simulation in Digital Twins

This article explores the integration of artificial intelligence with modeling and simulation in digital twins, highlighting their roles ...

arXiv - AI · 4 min ·
[2602.18733] Prior Aware Memorization: An Efficient Metric for Distinguishing Memorization from Generalization in Large Language Models
Llms

[2602.18733] Prior Aware Memorization: An Efficient Metric for Distinguishing Memorization from Generalization in Large Language Models

The paper introduces Prior Aware Memorization, a new metric for distinguishing genuine memorization from generalization in large language...

arXiv - Machine Learning · 4 min ·
[2602.18658] Communication-Efficient Personalized Adaptation via Federated-Local Model Merging
Llms

[2602.18658] Communication-Efficient Personalized Adaptation via Federated-Local Model Merging

The paper presents Potara, a framework for federated personalization that merges general and personalized models, improving efficiency an...

arXiv - Machine Learning · 3 min ·
[2602.19128] K-Search: LLM Kernel Generation via Co-Evolving Intrinsic World Model
Llms

[2602.19128] K-Search: LLM Kernel Generation via Co-Evolving Intrinsic World Model

The paper presents K-Search, a novel framework for optimizing GPU kernels using a co-evolving intrinsic world model, significantly improv...

arXiv - AI · 4 min ·
[2602.18647] Information-Guided Noise Allocation for Efficient Diffusion Training
Machine Learning

[2602.18647] Information-Guided Noise Allocation for Efficient Diffusion Training

The paper presents InfoNoise, a data-adaptive noise scheduling method for diffusion training, enhancing efficiency and performance by uti...

arXiv - AI · 4 min ·
[2602.18645] Adaptive Time Series Reasoning via Segment Selection
Machine Learning

[2602.18645] Adaptive Time Series Reasoning via Segment Selection

The paper presents ARTIST, a novel approach to time series reasoning that utilizes adaptive segment selection to improve accuracy in answ...

arXiv - Machine Learning · 4 min ·
Previous Page 104 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime