AI Infrastructure

GPUs, training clusters, MLOps, and deployment

Top This Week

CUDA Proves Nvidia Is a Software Company | WIRED
Ai Infrastructure

CUDA Proves Nvidia Is a Software Company | WIRED

There’s a deep, forbidding moat that surrounds Nvidia—and it has nothing to do with hardware.

Wired - AI · 9 min ·
[2511.02805] MemSearcher: Training LLMs to Reason, Search and Manage Memory via End-to-End Reinforcement Learning
Llms

[2511.02805] MemSearcher: Training LLMs to Reason, Search and Manage Memory via End-to-End Reinforcement Learning

Abstract page for arXiv paper 2511.02805: MemSearcher: Training LLMs to Reason, Search and Manage Memory via End-to-End Reinforcement Lea...

arXiv - AI · 3 min ·
[2510.22944] Is Your Prompt Poisoning Code? Defect Induction Rates and Security Mitigation Strategies
Llms

[2510.22944] Is Your Prompt Poisoning Code? Defect Induction Rates and Security Mitigation Strategies

Abstract page for arXiv paper 2510.22944: Is Your Prompt Poisoning Code? Defect Induction Rates and Security Mitigation Strategies

arXiv - AI · 4 min ·

All Content

[2605.06927] XiYOLO: Energy-Aware Object Detection via Iterative Architecture Search and Scaling
Computer Vision

[2605.06927] XiYOLO: Energy-Aware Object Detection via Iterative Architecture Search and Scaling

Abstract page for arXiv paper 2605.06927: XiYOLO: Energy-Aware Object Detection via Iterative Architecture Search and Scaling

arXiv - AI · 3 min ·
[2605.06914] Regulating Branch Parallelism in LLM Serving
Llms

[2605.06914] Regulating Branch Parallelism in LLM Serving

Abstract page for arXiv paper 2605.06914: Regulating Branch Parallelism in LLM Serving

arXiv - AI · 3 min ·
[2605.06875] EULER-ADAS: Energy-Efficient & SIMD-Unified Logarithmic-Posit Engine for Precision-Reconfigurable Approximate ADAS Acceleration
Machine Learning

[2605.06875] EULER-ADAS: Energy-Efficient & SIMD-Unified Logarithmic-Posit Engine for Precision-Reconfigurable Approximate ADAS Acceleration

Abstract page for arXiv paper 2605.06875: EULER-ADAS: Energy-Efficient & SIMD-Unified Logarithmic-Posit Engine for Precision-Reconfigurab...

arXiv - AI · 4 min ·
[2605.06820] Overcoming data scarcity through multi-center federated learning for organs-at-risk segmentation in pediatric upper abdominal radiotherapy
Machine Learning

[2605.06820] Overcoming data scarcity through multi-center federated learning for organs-at-risk segmentation in pediatric upper abdominal radiotherapy

Abstract page for arXiv paper 2605.06820: Overcoming data scarcity through multi-center federated learning for organs-at-risk segmentatio...

arXiv - AI · 4 min ·
[2605.06738] From Specification to Deployment: Empirical Evidence from a W3C VC + DID Trust Infrastructure for Autonomous Agents
Robotics

[2605.06738] From Specification to Deployment: Empirical Evidence from a W3C VC + DID Trust Infrastructure for Autonomous Agents

Abstract page for arXiv paper 2605.06738: From Specification to Deployment: Empirical Evidence from a W3C VC + DID Trust Infrastructure f...

arXiv - AI · 4 min ·
[2605.06707] The Single-File Test: A Longitudinal Public-Interface Evaluation of First-Output LLM Web Generation with Social Reach Tracking
Llms

[2605.06707] The Single-File Test: A Longitudinal Public-Interface Evaluation of First-Output LLM Web Generation with Social Reach Tracking

Abstract page for arXiv paper 2605.06707: The Single-File Test: A Longitudinal Public-Interface Evaluation of First-Output LLM Web Genera...

arXiv - AI · 4 min ·
[2504.11101] Consensus Entropy: Harnessing Multi-VLM Agreement for Self-Verifying and Self-Improving OCR
Llms

[2504.11101] Consensus Entropy: Harnessing Multi-VLM Agreement for Self-Verifying and Self-Improving OCR

Abstract page for arXiv paper 2504.11101: Consensus Entropy: Harnessing Multi-VLM Agreement for Self-Verifying and Self-Improving OCR

arXiv - AI · 4 min ·
[2605.08070] VecCISC: Improving Confidence-Informed Self-Consistency with Reasoning Trace Clustering and Candidate Answer Selection
Llms

[2605.08070] VecCISC: Improving Confidence-Informed Self-Consistency with Reasoning Trace Clustering and Candidate Answer Selection

Abstract page for arXiv paper 2605.08070: VecCISC: Improving Confidence-Informed Self-Consistency with Reasoning Trace Clustering and Can...

arXiv - AI · 4 min ·
[2605.08024] MPD$^2$-Router: Mask-aware Multi-expert Prior-regularized Dual-head Deferral Router in Glaucoma Screening and Diagnosis
Ai Infrastructure

[2605.08024] MPD$^2$-Router: Mask-aware Multi-expert Prior-regularized Dual-head Deferral Router in Glaucoma Screening and Diagnosis

Abstract page for arXiv paper 2605.08024: MPD$^2$-Router: Mask-aware Multi-expert Prior-regularized Dual-head Deferral Router in Glaucoma...

arXiv - AI · 3 min ·
[2605.07639] Tacit Knowledge Extraction via Logic Augmented Generation and Active Inference
Machine Learning

[2605.07639] Tacit Knowledge Extraction via Logic Augmented Generation and Active Inference

Abstract page for arXiv paper 2605.07639: Tacit Knowledge Extraction via Logic Augmented Generation and Active Inference

arXiv - AI · 3 min ·
[2605.07631] Inference Time Causal Probing in LLMs
Llms

[2605.07631] Inference Time Causal Probing in LLMs

Abstract page for arXiv paper 2605.07631: Inference Time Causal Probing in LLMs

arXiv - AI · 3 min ·
[2605.07357] GraphReAct: Reasoning and Acting for Multi-step Graph Inference
Llms

[2605.07357] GraphReAct: Reasoning and Acting for Multi-step Graph Inference

Abstract page for arXiv paper 2605.07357: GraphReAct: Reasoning and Acting for Multi-step Graph Inference

arXiv - AI · 4 min ·
[2605.07313] When Stored Evidence Stops Being Usable: Scale-Conditioned Evaluation of Agent Memory
Nlp

[2605.07313] When Stored Evidence Stops Being Usable: Scale-Conditioned Evaluation of Agent Memory

Abstract page for arXiv paper 2605.07313: When Stored Evidence Stops Being Usable: Scale-Conditioned Evaluation of Agent Memory

arXiv - AI · 3 min ·
[2605.07242] MEMOREPAIR: Barrier-First Cascade Repair in Agentic Memory
Nlp

[2605.07242] MEMOREPAIR: Barrier-First Cascade Repair in Agentic Memory

Abstract page for arXiv paper 2605.07242: MEMOREPAIR: Barrier-First Cascade Repair in Agentic Memory

arXiv - AI · 4 min ·
[2605.07112] Switchcraft: AI Model Router for Agentic Tool Calling
Machine Learning

[2605.07112] Switchcraft: AI Model Router for Agentic Tool Calling

Abstract page for arXiv paper 2605.07112: Switchcraft: AI Model Router for Agentic Tool Calling

arXiv - AI · 3 min ·
[2605.06993] Optimal Experiments for Partial Causal Effect Identification
Ai Infrastructure

[2605.06993] Optimal Experiments for Partial Causal Effect Identification

Abstract page for arXiv paper 2605.06993: Optimal Experiments for Partial Causal Effect Identification

arXiv - AI · 4 min ·
[2605.06895] Mitigating Cognitive Bias in RLHF by Altering Rationality
Machine Learning

[2605.06895] Mitigating Cognitive Bias in RLHF by Altering Rationality

Abstract page for arXiv paper 2605.06895: Mitigating Cognitive Bias in RLHF by Altering Rationality

arXiv - AI · 3 min ·
[2605.06890] Beyond the Black Box: Interpretability of Agentic AI Tool Use
Ai Infrastructure

[2605.06890] Beyond the Black Box: Interpretability of Agentic AI Tool Use

Abstract page for arXiv paper 2605.06890: Beyond the Black Box: Interpretability of Agentic AI Tool Use

arXiv - AI · 4 min ·
[2605.06825] Randomness is sometimes necessary for coordination
Ai Infrastructure

[2605.06825] Randomness is sometimes necessary for coordination

Abstract page for arXiv paper 2605.06825: Randomness is sometimes necessary for coordination

arXiv - AI · 3 min ·
[2602.04556] Rethinking Weight Tying: Pseudo-Inverse Tying for LM Stable Training and Updates
Llms

[2602.04556] Rethinking Weight Tying: Pseudo-Inverse Tying for LM Stable Training and Updates

Abstract page for arXiv paper 2602.04556: Rethinking Weight Tying: Pseudo-Inverse Tying for LM Stable Training and Updates

arXiv - Machine Learning · 4 min ·
Previous Page 2 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime