AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 1 hour ago

Llms

[D] How's MLX and jax/ pytorch on MacBooks these days?

So I'm looking at buying a new 14 inch MacBook pro with m5 pro and 64 gb of memory vs m4 max with same specs. My priorities are pro sof...

Reddit - Machine Learning · 1 min · about 2 hours ago

Ai Infrastructure

Who needs fancy stuff, When you can program, build, train and run 2 completely different ai agents on an i3 4GB RAM and onboard gpu chip? looool

And I know some of yall doubt - so I’ll follow up. submitted by /u/Snoo-76697 [link] [comments]

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

All Content

Machine Learning

[2602.20031] Latent Introspection: Models Can Detect Prior Concept Injections

This article presents findings on the latent introspection abilities of the Qwen 32B model, showing its capacity to detect prior concept ...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.18934] LoMime: Query-Efficient Membership Inference using Model Extraction in Label-Only Settings

The paper presents LoMime, a novel framework for membership inference attacks that operates efficiently under label-only conditions, sign...

arXiv - Machine Learning · 4 min · about 1 month ago

Ai Infrastructure

[2602.18910] SLDP: Semi-Local Differential Privacy for Density-Adaptive Analytics

The paper introduces Semi-Local Differential Privacy (SLDP), a framework that enhances privacy-preserving analytics by decoupling privacy...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.19519] Ada-RS: Adaptive Rejection Sampling for Selective Thinking

The paper introduces Ada-RS, an adaptive rejection sampling framework aimed at enhancing selective thinking in large language models (LLM...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.18851] Rank-Aware Spectral Bounds on Attention Logits for Stable Low-Precision Training

This paper presents a novel approach to stabilize low-precision training in transformer models by deriving rank-aware spectral bounds on ...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.19458] ComplLLM: Fine-tuning LLMs to Discover Complementary Signals for Decision-making

The paper presents ComplLLM, a framework for fine-tuning large language models (LLMs) to enhance decision-making by utilizing complementa...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.18825] Bayesian Lottery Ticket Hypothesis

The paper explores the Bayesian Lottery Ticket Hypothesis, demonstrating that sparse subnetworks in Bayesian neural networks can achieve ...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.18795] Vectorized Bayesian Inference for Latent Dirichlet-Tree Allocation

This paper presents a novel framework, Latent Dirichlet-Tree Allocation (LDTA), which enhances the traditional Latent Dirichlet Allocatio...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.19390] Artificial Intelligence for Modeling & Simulation in Digital Twins

This article explores the integration of artificial intelligence with modeling and simulation in digital twins, highlighting their roles ...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.18733] Prior Aware Memorization: An Efficient Metric for Distinguishing Memorization from Generalization in Large Language Models

The paper introduces Prior Aware Memorization, a new metric for distinguishing genuine memorization from generalization in large language...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.18658] Communication-Efficient Personalized Adaptation via Federated-Local Model Merging

The paper presents Potara, a framework for federated personalization that merges general and personalized models, improving efficiency an...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.19128] K-Search: LLM Kernel Generation via Co-Evolving Intrinsic World Model

The paper presents K-Search, a novel framework for optimizing GPU kernels using a co-evolving intrinsic world model, significantly improv...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.18647] Information-Guided Noise Allocation for Efficient Diffusion Training

The paper presents InfoNoise, a data-adaptive noise scheduling method for diffusion training, enhancing efficiency and performance by uti...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.18645] Adaptive Time Series Reasoning via Segment Selection

The paper presents ARTIST, a novel approach to time series reasoning that utilizes adaptive segment selection to improve accuracy in answ...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.18613] Diagnosing LLM Reranker Behavior Under Fixed Evidence Pools

This paper presents a diagnostic method for evaluating LLM reranker behavior using fixed evidence pools, isolating ranking policies from ...

arXiv - Machine Learning · 3 min · about 1 month ago

Ai Safety

[2602.18986] Quantifying Automation Risk in High-Automation AI Systems: A Bayesian Framework for Failure Propagation and Optimal Oversight

This paper presents a Bayesian framework for assessing automation risk in high-automation AI systems, focusing on failure propagation and...

arXiv - AI · 4 min · about 1 month ago

Robotics

[2602.18985] InfEngine: A Self-Verifying and Self-Optimizing Intelligent Engine for Infrared Radiation Computing

InfEngine is an innovative autonomous engine designed to enhance infrared radiation computing by automating workflows, achieving a 92.7% ...

arXiv - AI · 3 min · about 1 month ago

Ai Agents

[2602.18968] Robust and Efficient Tool Orchestration via Layered Execution Structures with Reflective Correction

This article presents a novel approach to tool orchestration in agentic systems, emphasizing a layered execution structure that enhances ...

arXiv - AI · 4 min · about 1 month ago

Ai Agents

[2602.18960] Modularity is the Bedrock of Natural and Artificial Intelligence

The paper discusses the importance of modularity in both natural and artificial intelligence, highlighting its role in efficient learning...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.18956] INDUCTION: Finite-Structure Concept Synthesis in First-Order Logic

The paper introduces INDUCTION, a benchmark for finite structure concept synthesis in first-order logic, focusing on generating logical f...

arXiv - AI · 3 min · about 1 month ago

Previous Page 98 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence

[D] How's MLX and jax/ pytorch on MacBooks these days?

Who needs fancy stuff, When you can program, build, train and run 2 completely different ai agents on an i3 4GB RAM and onboard gpu chip? looool

All Content

[2602.20031] Latent Introspection: Models Can Detect Prior Concept Injections

[2602.18934] LoMime: Query-Efficient Membership Inference using Model Extraction in Label-Only Settings

[2602.18910] SLDP: Semi-Local Differential Privacy for Density-Adaptive Analytics

[2602.19519] Ada-RS: Adaptive Rejection Sampling for Selective Thinking

[2602.18851] Rank-Aware Spectral Bounds on Attention Logits for Stable Low-Precision Training

[2602.19458] ComplLLM: Fine-tuning LLMs to Discover Complementary Signals for Decision-making

[2602.18825] Bayesian Lottery Ticket Hypothesis

[2602.18795] Vectorized Bayesian Inference for Latent Dirichlet-Tree Allocation

[2602.19390] Artificial Intelligence for Modeling & Simulation in Digital Twins

[2602.18733] Prior Aware Memorization: An Efficient Metric for Distinguishing Memorization from Generalization in Large Language Models

[2602.18658] Communication-Efficient Personalized Adaptation via Federated-Local Model Merging

[2602.19128] K-Search: LLM Kernel Generation via Co-Evolving Intrinsic World Model

[2602.18647] Information-Guided Noise Allocation for Efficient Diffusion Training

[2602.18645] Adaptive Time Series Reasoning via Segment Selection

[2602.18613] Diagnosing LLM Reranker Behavior Under Fixed Evidence Pools

[2602.18986] Quantifying Automation Risk in High-Automation AI Systems: A Bayesian Framework for Failure Propagation and Optimal Oversight

[2602.18985] InfEngine: A Self-Verifying and Self-Optimizing Intelligent Engine for Infrared Radiation Computing

[2602.18968] Robust and Efficient Tool Orchestration via Layered Execution Structures with Reflective Correction

[2602.18960] Modularity is the Bedrock of Natural and Artificial Intelligence

[2602.18956] INDUCTION: Finite-Structure Concept Synthesis in First-Order Logic

Related Topics

Stay updated with AI News