AI Infrastructure

GPUs, training clusters, MLOps, and deployment

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
[2603.15159] To See is Not to Master: Teaching LLMs to Use Private Libraries for Code Generation
Llms

[2603.15159] To See is Not to Master: Teaching LLMs to Use Private Libraries for Code Generation

Abstract page for arXiv paper 2603.15159: To See is Not to Master: Teaching LLMs to Use Private Libraries for Code Generation

arXiv - AI · 4 min ·
[2602.07374] TernaryLM: Memory-Efficient Language Modeling via Native 1.5-Bit Quantization with Adaptive Layer-wise Scaling
Llms

[2602.07374] TernaryLM: Memory-Efficient Language Modeling via Native 1.5-Bit Quantization with Adaptive Layer-wise Scaling

Abstract page for arXiv paper 2602.07374: TernaryLM: Memory-Efficient Language Modeling via Native 1.5-Bit Quantization with Adaptive Lay...

arXiv - AI · 4 min ·

All Content

[2603.20223] Inference Energy and Latency in AI-Mediated Education: A Learning-per-Watt Analysis of Edge and Cloud Models
Machine Learning

[2603.20223] Inference Energy and Latency in AI-Mediated Education: A Learning-per-Watt Analysis of Edge and Cloud Models

Abstract page for arXiv paper 2603.20223: Inference Energy and Latency in AI-Mediated Education: A Learning-per-Watt Analysis of Edge and...

arXiv - Machine Learning · 4 min ·
[2603.22096] GSEM: Graph-based Self-Evolving Memory for Experience Augmented Clinical Reasoning
Llms

[2603.22096] GSEM: Graph-based Self-Evolving Memory for Experience Augmented Clinical Reasoning

Abstract page for arXiv paper 2603.22096: GSEM: Graph-based Self-Evolving Memory for Experience Augmented Clinical Reasoning

arXiv - AI · 3 min ·
[2603.22083] A Context Engineering Framework for Improving Enterprise AI Agents based on Digital-Twin MDP
Llms

[2603.22083] A Context Engineering Framework for Improving Enterprise AI Agents based on Digital-Twin MDP

Abstract page for arXiv paper 2603.22083: A Context Engineering Framework for Improving Enterprise AI Agents based on Digital-Twin MDP

arXiv - AI · 4 min ·
[2603.21925] Guideline-grounded retrieval-augmented generation for ophthalmic clinical decision support
Nlp

[2603.21925] Guideline-grounded retrieval-augmented generation for ophthalmic clinical decision support

Abstract page for arXiv paper 2603.21925: Guideline-grounded retrieval-augmented generation for ophthalmic clinical decision support

arXiv - AI · 4 min ·
[2603.21854] Reasoning or Rhetoric? An Empirical Analysis of Moral Reasoning Explanations in Large Language Models
Llms

[2603.21854] Reasoning or Rhetoric? An Empirical Analysis of Moral Reasoning Explanations in Large Language Models

Abstract page for arXiv paper 2603.21854: Reasoning or Rhetoric? An Empirical Analysis of Moral Reasoning Explanations in Large Language ...

arXiv - AI · 4 min ·
[2603.21696] MIND: Multi-agent inference for negotiation dialogue in travel planning
Machine Learning

[2603.21696] MIND: Multi-agent inference for negotiation dialogue in travel planning

Abstract page for arXiv paper 2603.21696: MIND: Multi-agent inference for negotiation dialogue in travel planning

arXiv - AI · 3 min ·
[2603.21690] AI Token Futures Market: Commoditization of Compute and Derivatives Contract Design
Llms

[2603.21690] AI Token Futures Market: Commoditization of Compute and Derivatives Contract Design

Abstract page for arXiv paper 2603.21690: AI Token Futures Market: Commoditization of Compute and Derivatives Contract Design

arXiv - AI · 3 min ·
[2603.21630] EnterpriseLab: A Full-Stack Platform for developing and deploying agents in Enterprises
Llms

[2603.21630] EnterpriseLab: A Full-Stack Platform for developing and deploying agents in Enterprises

Abstract page for arXiv paper 2603.21630: EnterpriseLab: A Full-Stack Platform for developing and deploying agents in Enterprises

arXiv - AI · 3 min ·
[2603.21558] Stabilizing Iterative Self-Training with Verified Reasoning via Symbolic Recursive Self-Alignment
Machine Learning

[2603.21558] Stabilizing Iterative Self-Training with Verified Reasoning via Symbolic Recursive Self-Alignment

Abstract page for arXiv paper 2603.21558: Stabilizing Iterative Self-Training with Verified Reasoning via Symbolic Recursive Self-Alignment

arXiv - AI · 4 min ·
[2603.21415] Silent Commitment Failure in Instruction-Tuned Language Models: Evidence of Governability Divergence Across Architectures
Llms

[2603.21415] Silent Commitment Failure in Instruction-Tuned Language Models: Evidence of Governability Divergence Across Architectures

Abstract page for arXiv paper 2603.21415: Silent Commitment Failure in Instruction-Tuned Language Models: Evidence of Governability Diver...

arXiv - Machine Learning · 4 min ·
[2603.21237] ConsRoute:Consistency-Aware Adaptive Query Routing for Cloud-Edge-Device Large Language Models
Llms

[2603.21237] ConsRoute:Consistency-Aware Adaptive Query Routing for Cloud-Edge-Device Large Language Models

Abstract page for arXiv paper 2603.21237: ConsRoute:Consistency-Aware Adaptive Query Routing for Cloud-Edge-Device Large Language Models

arXiv - AI · 4 min ·
[2603.21162] Revisiting Tree Search for LLMs: Gumbel and Sequential Halving for Budget-Scalable Reasoning
Llms

[2603.21162] Revisiting Tree Search for LLMs: Gumbel and Sequential Halving for Budget-Scalable Reasoning

Abstract page for arXiv paper 2603.21162: Revisiting Tree Search for LLMs: Gumbel and Sequential Halving for Budget-Scalable Reasoning

arXiv - Machine Learning · 3 min ·
[2603.20925] Profit is the Red Team: Stress-Testing Agents in Strategic Economic Interactions
Ai Safety

[2603.20925] Profit is the Red Team: Stress-Testing Agents in Strategic Economic Interactions

Abstract page for arXiv paper 2603.20925: Profit is the Red Team: Stress-Testing Agents in Strategic Economic Interactions

arXiv - AI · 4 min ·
[2603.20911] Do LLM-Driven Agents Exhibit Engagement Mechanisms? Controlled Tests of Information Load, Descriptive Norms, and Popularity Cues
Llms

[2603.20911] Do LLM-Driven Agents Exhibit Engagement Mechanisms? Controlled Tests of Information Load, Descriptive Norms, and Popularity Cues

Abstract page for arXiv paper 2603.20911: Do LLM-Driven Agents Exhibit Engagement Mechanisms? Controlled Tests of Information Load, Descr...

arXiv - AI · 3 min ·
[2603.20650] From 50% to Mastery in 3 Days: A Low-Resource SOP for Localizing Graduate-Level AI Tutors via Shadow-RAG
Llms

[2603.20650] From 50% to Mastery in 3 Days: A Low-Resource SOP for Localizing Graduate-Level AI Tutors via Shadow-RAG

Abstract page for arXiv paper 2603.20650: From 50% to Mastery in 3 Days: A Low-Resource SOP for Localizing Graduate-Level AI Tutors via S...

arXiv - AI · 4 min ·
[2603.20633] Seed1.8 Model Card: Towards Generalized Real-World Agency
Llms

[2603.20633] Seed1.8 Model Card: Towards Generalized Real-World Agency

Abstract page for arXiv paper 2603.20633: Seed1.8 Model Card: Towards Generalized Real-World Agency

arXiv - AI · 3 min ·
[2603.20620] Reasoning Traces Shape Outputs but Models Won't Say So
Machine Learning

[2603.20620] Reasoning Traces Shape Outputs but Models Won't Say So

Abstract page for arXiv paper 2603.20620: Reasoning Traces Shape Outputs but Models Won't Say So

arXiv - AI · 3 min ·
[2603.20510] Grounded Chess Reasoning in Language Models via Master Distillation
Llms

[2603.20510] Grounded Chess Reasoning in Language Models via Master Distillation

Abstract page for arXiv paper 2603.20510: Grounded Chess Reasoning in Language Models via Master Distillation

arXiv - AI · 4 min ·
[2603.20435] Deep reflective reasoning in interdependence constrained structured data extraction from clinical notes for digital health
Llms

[2603.20435] Deep reflective reasoning in interdependence constrained structured data extraction from clinical notes for digital health

Abstract page for arXiv paper 2603.20435: Deep reflective reasoning in interdependence constrained structured data extraction from clinic...

arXiv - AI · 4 min ·
[2603.20285] AgentComm-Bench: Stress-Testing Cooperative Embodied AI Under Latency, Packet Loss, and Bandwidth Collapse
Robotics

[2603.20285] AgentComm-Bench: Stress-Testing Cooperative Embodied AI Under Latency, Packet Loss, and Bandwidth Collapse

Abstract page for arXiv paper 2603.20285: AgentComm-Bench: Stress-Testing Cooperative Embodied AI Under Latency, Packet Loss, and Bandwid...

arXiv - AI · 4 min ·
Previous Page 21 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime