AI Infrastructure

GPUs, training clusters, MLOps, and deployment

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Llms

[D] How's MLX and jax/ pytorch on MacBooks these days?

​ So I'm looking at buying a new 14 inch MacBook pro with m5 pro and 64 gb of memory vs m4 max with same specs. My priorities are pro sof...

Reddit - Machine Learning · 1 min ·
Ai Infrastructure

Who needs fancy stuff, When you can program, build, train and run 2 completely different ai agents on an i3 4GB RAM and onboard gpu chip? looool

And I know some of yall doubt - so I’ll follow up. submitted by /u/Snoo-76697 [link] [comments]

Reddit - Artificial Intelligence · 1 min ·

All Content

[2602.20031] Latent Introspection: Models Can Detect Prior Concept Injections
Machine Learning

[2602.20031] Latent Introspection: Models Can Detect Prior Concept Injections

This article presents findings on the latent introspection abilities of the Qwen 32B model, showing its capacity to detect prior concept ...

arXiv - Machine Learning · 3 min ·
[2602.18934] LoMime: Query-Efficient Membership Inference using Model Extraction in Label-Only Settings
Machine Learning

[2602.18934] LoMime: Query-Efficient Membership Inference using Model Extraction in Label-Only Settings

The paper presents LoMime, a novel framework for membership inference attacks that operates efficiently under label-only conditions, sign...

arXiv - Machine Learning · 4 min ·
[2602.18910] SLDP: Semi-Local Differential Privacy for Density-Adaptive Analytics
Ai Infrastructure

[2602.18910] SLDP: Semi-Local Differential Privacy for Density-Adaptive Analytics

The paper introduces Semi-Local Differential Privacy (SLDP), a framework that enhances privacy-preserving analytics by decoupling privacy...

arXiv - Machine Learning · 3 min ·
[2602.19519] Ada-RS: Adaptive Rejection Sampling for Selective Thinking
Llms

[2602.19519] Ada-RS: Adaptive Rejection Sampling for Selective Thinking

The paper introduces Ada-RS, an adaptive rejection sampling framework aimed at enhancing selective thinking in large language models (LLM...

arXiv - Machine Learning · 3 min ·
[2602.18851] Rank-Aware Spectral Bounds on Attention Logits for Stable Low-Precision Training
Machine Learning

[2602.18851] Rank-Aware Spectral Bounds on Attention Logits for Stable Low-Precision Training

This paper presents a novel approach to stabilize low-precision training in transformer models by deriving rank-aware spectral bounds on ...

arXiv - AI · 3 min ·
[2602.19458] ComplLLM: Fine-tuning LLMs to Discover Complementary Signals for Decision-making
Llms

[2602.19458] ComplLLM: Fine-tuning LLMs to Discover Complementary Signals for Decision-making

The paper presents ComplLLM, a framework for fine-tuning large language models (LLMs) to enhance decision-making by utilizing complementa...

arXiv - AI · 3 min ·
[2602.18825] Bayesian Lottery Ticket Hypothesis
Machine Learning

[2602.18825] Bayesian Lottery Ticket Hypothesis

The paper explores the Bayesian Lottery Ticket Hypothesis, demonstrating that sparse subnetworks in Bayesian neural networks can achieve ...

arXiv - Machine Learning · 4 min ·
[2602.18795] Vectorized Bayesian Inference for Latent Dirichlet-Tree Allocation
Machine Learning

[2602.18795] Vectorized Bayesian Inference for Latent Dirichlet-Tree Allocation

This paper presents a novel framework, Latent Dirichlet-Tree Allocation (LDTA), which enhances the traditional Latent Dirichlet Allocatio...

arXiv - Machine Learning · 3 min ·
[2602.19390] Artificial Intelligence for Modeling & Simulation in Digital Twins
Machine Learning

[2602.19390] Artificial Intelligence for Modeling & Simulation in Digital Twins

This article explores the integration of artificial intelligence with modeling and simulation in digital twins, highlighting their roles ...

arXiv - AI · 4 min ·
[2602.18733] Prior Aware Memorization: An Efficient Metric for Distinguishing Memorization from Generalization in Large Language Models
Llms

[2602.18733] Prior Aware Memorization: An Efficient Metric for Distinguishing Memorization from Generalization in Large Language Models

The paper introduces Prior Aware Memorization, a new metric for distinguishing genuine memorization from generalization in large language...

arXiv - Machine Learning · 4 min ·
[2602.18658] Communication-Efficient Personalized Adaptation via Federated-Local Model Merging
Llms

[2602.18658] Communication-Efficient Personalized Adaptation via Federated-Local Model Merging

The paper presents Potara, a framework for federated personalization that merges general and personalized models, improving efficiency an...

arXiv - Machine Learning · 3 min ·
[2602.19128] K-Search: LLM Kernel Generation via Co-Evolving Intrinsic World Model
Llms

[2602.19128] K-Search: LLM Kernel Generation via Co-Evolving Intrinsic World Model

The paper presents K-Search, a novel framework for optimizing GPU kernels using a co-evolving intrinsic world model, significantly improv...

arXiv - AI · 4 min ·
[2602.18647] Information-Guided Noise Allocation for Efficient Diffusion Training
Machine Learning

[2602.18647] Information-Guided Noise Allocation for Efficient Diffusion Training

The paper presents InfoNoise, a data-adaptive noise scheduling method for diffusion training, enhancing efficiency and performance by uti...

arXiv - AI · 4 min ·
[2602.18645] Adaptive Time Series Reasoning via Segment Selection
Machine Learning

[2602.18645] Adaptive Time Series Reasoning via Segment Selection

The paper presents ARTIST, a novel approach to time series reasoning that utilizes adaptive segment selection to improve accuracy in answ...

arXiv - Machine Learning · 4 min ·
[2602.18613] Diagnosing LLM Reranker Behavior Under Fixed Evidence Pools
Llms

[2602.18613] Diagnosing LLM Reranker Behavior Under Fixed Evidence Pools

This paper presents a diagnostic method for evaluating LLM reranker behavior using fixed evidence pools, isolating ranking policies from ...

arXiv - Machine Learning · 3 min ·
[2602.18986] Quantifying Automation Risk in High-Automation AI Systems: A Bayesian Framework for Failure Propagation and Optimal Oversight
Ai Safety

[2602.18986] Quantifying Automation Risk in High-Automation AI Systems: A Bayesian Framework for Failure Propagation and Optimal Oversight

This paper presents a Bayesian framework for assessing automation risk in high-automation AI systems, focusing on failure propagation and...

arXiv - AI · 4 min ·
[2602.18985] InfEngine: A Self-Verifying and Self-Optimizing Intelligent Engine for Infrared Radiation Computing
Robotics

[2602.18985] InfEngine: A Self-Verifying and Self-Optimizing Intelligent Engine for Infrared Radiation Computing

InfEngine is an innovative autonomous engine designed to enhance infrared radiation computing by automating workflows, achieving a 92.7% ...

arXiv - AI · 3 min ·
[2602.18968] Robust and Efficient Tool Orchestration via Layered Execution Structures with Reflective Correction
Ai Agents

[2602.18968] Robust and Efficient Tool Orchestration via Layered Execution Structures with Reflective Correction

This article presents a novel approach to tool orchestration in agentic systems, emphasizing a layered execution structure that enhances ...

arXiv - AI · 4 min ·
[2602.18960] Modularity is the Bedrock of Natural and Artificial Intelligence
Ai Agents

[2602.18960] Modularity is the Bedrock of Natural and Artificial Intelligence

The paper discusses the importance of modularity in both natural and artificial intelligence, highlighting its role in efficient learning...

arXiv - AI · 4 min ·
[2602.18956] INDUCTION: Finite-Structure Concept Synthesis in First-Order Logic
Machine Learning

[2602.18956] INDUCTION: Finite-Structure Concept Synthesis in First-Order Logic

The paper introduces INDUCTION, a benchmark for finite structure concept synthesis in first-order logic, focusing on generating logical f...

arXiv - AI · 3 min ·
Previous Page 98 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime