AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · 10 minutes ago

Llms

[2603.15159] To See is Not to Master: Teaching LLMs to Use Private Libraries for Code Generation

Abstract page for arXiv paper 2603.15159: To See is Not to Master: Teaching LLMs to Use Private Libraries for Code Generation

arXiv - AI · 4 min · about 3 hours ago

Llms

[2602.07374] TernaryLM: Memory-Efficient Language Modeling via Native 1.5-Bit Quantization with Adaptive Layer-wise Scaling

Abstract page for arXiv paper 2602.07374: TernaryLM: Memory-Efficient Language Modeling via Native 1.5-Bit Quantization with Adaptive Lay...

arXiv - AI · 4 min · about 3 hours ago

All Content

Machine Learning

[2603.20223] Inference Energy and Latency in AI-Mediated Education: A Learning-per-Watt Analysis of Edge and Cloud Models

Abstract page for arXiv paper 2603.20223: Inference Energy and Latency in AI-Mediated Education: A Learning-per-Watt Analysis of Edge and...

arXiv - Machine Learning · 4 min · 6 days ago

Llms

[2603.22096] GSEM: Graph-based Self-Evolving Memory for Experience Augmented Clinical Reasoning

Abstract page for arXiv paper 2603.22096: GSEM: Graph-based Self-Evolving Memory for Experience Augmented Clinical Reasoning

arXiv - AI · 3 min · 6 days ago

Llms

[2603.22083] A Context Engineering Framework for Improving Enterprise AI Agents based on Digital-Twin MDP

Abstract page for arXiv paper 2603.22083: A Context Engineering Framework for Improving Enterprise AI Agents based on Digital-Twin MDP

arXiv - AI · 4 min · 6 days ago

Nlp

[2603.21925] Guideline-grounded retrieval-augmented generation for ophthalmic clinical decision support

Abstract page for arXiv paper 2603.21925: Guideline-grounded retrieval-augmented generation for ophthalmic clinical decision support

arXiv - AI · 4 min · 6 days ago

Llms

[2603.21854] Reasoning or Rhetoric? An Empirical Analysis of Moral Reasoning Explanations in Large Language Models

Abstract page for arXiv paper 2603.21854: Reasoning or Rhetoric? An Empirical Analysis of Moral Reasoning Explanations in Large Language ...

arXiv - AI · 4 min · 6 days ago

Machine Learning

[2603.21696] MIND: Multi-agent inference for negotiation dialogue in travel planning

Abstract page for arXiv paper 2603.21696: MIND: Multi-agent inference for negotiation dialogue in travel planning

arXiv - AI · 3 min · 6 days ago

Llms

[2603.21690] AI Token Futures Market: Commoditization of Compute and Derivatives Contract Design

Abstract page for arXiv paper 2603.21690: AI Token Futures Market: Commoditization of Compute and Derivatives Contract Design

arXiv - AI · 3 min · 6 days ago

Llms

[2603.21630] EnterpriseLab: A Full-Stack Platform for developing and deploying agents in Enterprises

Abstract page for arXiv paper 2603.21630: EnterpriseLab: A Full-Stack Platform for developing and deploying agents in Enterprises

arXiv - AI · 3 min · 6 days ago

Machine Learning

[2603.21558] Stabilizing Iterative Self-Training with Verified Reasoning via Symbolic Recursive Self-Alignment

Abstract page for arXiv paper 2603.21558: Stabilizing Iterative Self-Training with Verified Reasoning via Symbolic Recursive Self-Alignment

arXiv - AI · 4 min · 6 days ago

Llms

[2603.21415] Silent Commitment Failure in Instruction-Tuned Language Models: Evidence of Governability Divergence Across Architectures

Abstract page for arXiv paper 2603.21415: Silent Commitment Failure in Instruction-Tuned Language Models: Evidence of Governability Diver...

arXiv - Machine Learning · 4 min · 6 days ago

Llms

[2603.21237] ConsRoute:Consistency-Aware Adaptive Query Routing for Cloud-Edge-Device Large Language Models

Abstract page for arXiv paper 2603.21237: ConsRoute:Consistency-Aware Adaptive Query Routing for Cloud-Edge-Device Large Language Models

arXiv - AI · 4 min · 6 days ago

Llms

[2603.21162] Revisiting Tree Search for LLMs: Gumbel and Sequential Halving for Budget-Scalable Reasoning

Abstract page for arXiv paper 2603.21162: Revisiting Tree Search for LLMs: Gumbel and Sequential Halving for Budget-Scalable Reasoning

arXiv - Machine Learning · 3 min · 6 days ago

Ai Safety

[2603.20925] Profit is the Red Team: Stress-Testing Agents in Strategic Economic Interactions

Abstract page for arXiv paper 2603.20925: Profit is the Red Team: Stress-Testing Agents in Strategic Economic Interactions

arXiv - AI · 4 min · 6 days ago

Llms

[2603.20911] Do LLM-Driven Agents Exhibit Engagement Mechanisms? Controlled Tests of Information Load, Descriptive Norms, and Popularity Cues

Abstract page for arXiv paper 2603.20911: Do LLM-Driven Agents Exhibit Engagement Mechanisms? Controlled Tests of Information Load, Descr...

arXiv - AI · 3 min · 6 days ago

Llms

[2603.20650] From 50% to Mastery in 3 Days: A Low-Resource SOP for Localizing Graduate-Level AI Tutors via Shadow-RAG

Abstract page for arXiv paper 2603.20650: From 50% to Mastery in 3 Days: A Low-Resource SOP for Localizing Graduate-Level AI Tutors via S...

arXiv - AI · 4 min · 6 days ago

Llms

[2603.20633] Seed1.8 Model Card: Towards Generalized Real-World Agency

Abstract page for arXiv paper 2603.20633: Seed1.8 Model Card: Towards Generalized Real-World Agency

arXiv - AI · 3 min · 6 days ago

Machine Learning

[2603.20620] Reasoning Traces Shape Outputs but Models Won't Say So

Abstract page for arXiv paper 2603.20620: Reasoning Traces Shape Outputs but Models Won't Say So

arXiv - AI · 3 min · 6 days ago

Llms

[2603.20510] Grounded Chess Reasoning in Language Models via Master Distillation

Abstract page for arXiv paper 2603.20510: Grounded Chess Reasoning in Language Models via Master Distillation

arXiv - AI · 4 min · 6 days ago

Llms

[2603.20435] Deep reflective reasoning in interdependence constrained structured data extraction from clinical notes for digital health

Abstract page for arXiv paper 2603.20435: Deep reflective reasoning in interdependence constrained structured data extraction from clinic...

arXiv - AI · 4 min · 6 days ago

Robotics

[2603.20285] AgentComm-Bench: Stress-Testing Cooperative Embodied AI Under Latency, Packet Loss, and Bandwidth Collapse

Abstract page for arXiv paper 2603.20285: AgentComm-Bench: Stress-Testing Cooperative Embodied AI Under Latency, Packet Loss, and Bandwid...

arXiv - AI · 4 min · 6 days ago

Previous Page 21 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence

[2603.15159] To See is Not to Master: Teaching LLMs to Use Private Libraries for Code Generation

[2602.07374] TernaryLM: Memory-Efficient Language Modeling via Native 1.5-Bit Quantization with Adaptive Layer-wise Scaling

All Content

[2603.20223] Inference Energy and Latency in AI-Mediated Education: A Learning-per-Watt Analysis of Edge and Cloud Models

[2603.22096] GSEM: Graph-based Self-Evolving Memory for Experience Augmented Clinical Reasoning

[2603.22083] A Context Engineering Framework for Improving Enterprise AI Agents based on Digital-Twin MDP

[2603.21925] Guideline-grounded retrieval-augmented generation for ophthalmic clinical decision support

[2603.21854] Reasoning or Rhetoric? An Empirical Analysis of Moral Reasoning Explanations in Large Language Models

[2603.21696] MIND: Multi-agent inference for negotiation dialogue in travel planning

[2603.21690] AI Token Futures Market: Commoditization of Compute and Derivatives Contract Design

[2603.21630] EnterpriseLab: A Full-Stack Platform for developing and deploying agents in Enterprises

[2603.21558] Stabilizing Iterative Self-Training with Verified Reasoning via Symbolic Recursive Self-Alignment

[2603.21415] Silent Commitment Failure in Instruction-Tuned Language Models: Evidence of Governability Divergence Across Architectures

[2603.21237] ConsRoute:Consistency-Aware Adaptive Query Routing for Cloud-Edge-Device Large Language Models

[2603.21162] Revisiting Tree Search for LLMs: Gumbel and Sequential Halving for Budget-Scalable Reasoning

[2603.20925] Profit is the Red Team: Stress-Testing Agents in Strategic Economic Interactions

[2603.20911] Do LLM-Driven Agents Exhibit Engagement Mechanisms? Controlled Tests of Information Load, Descriptive Norms, and Popularity Cues

[2603.20650] From 50% to Mastery in 3 Days: A Low-Resource SOP for Localizing Graduate-Level AI Tutors via Shadow-RAG

[2603.20633] Seed1.8 Model Card: Towards Generalized Real-World Agency

[2603.20620] Reasoning Traces Shape Outputs but Models Won't Say So

[2603.20510] Grounded Chess Reasoning in Language Models via Master Distillation

[2603.20435] Deep reflective reasoning in interdependence constrained structured data extraction from clinical notes for digital health

[2603.20285] AgentComm-Bench: Stress-Testing Cooperative Embodied AI Under Latency, Packet Loss, and Bandwidth Collapse

Related Topics

Stay updated with AI News