AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 2 hours ago

Ai Infrastructure

Most people are using AI wrong—and it’s capping what they can do

1 is a fluke. 2 is a coincidence. 3 is a pattern. Lately I’ve been noticing something. The problems I’m solving are getting more complex…...

Reddit - Artificial Intelligence · 1 min · about 8 hours ago

Ai Infrastructure

Most people are using AI wrong—and it’s capping what they can do

1 is a fluke. 2 is a coincidence. 3 is a pattern. Lately I’ve been noticing something. The problems I’m solving are getting more complex…...

Reddit - Artificial Intelligence · 1 min · about 8 hours ago

All Content

Llms

[2602.20217] KnapSpec: Self-Speculative Decoding via Adaptive Layer Selection as a Knapsack Problem

The paper introduces KnapSpec, a framework for self-speculative decoding that optimizes layer selection in LLMs as a knapsack problem, en...

arXiv - Machine Learning · 4 min · about 1 month ago

Ai Safety

[2602.20214] Right to History: A Sovereignty Kernel for Verifiable AI Agent Execution

This paper proposes the 'Right to History,' a principle ensuring individuals have a verifiable record of AI agent actions on personal har...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.20208] Model Merging in the Essential Subspace

This paper presents ESM, a novel framework for merging multiple task-specific models into a single multi-task model, addressing inter-tas...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.20207] Golden Layers and Where to Find Them: Improved Knowledge Editing for Large Language Models Via Layer Gradient Analysis

This article discusses the concept of 'golden layers' in large language models (LLMs) and presents a novel method, Layer Gradient Analysi...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.20204] Analyzing Latency Hiding and Parallelism in an MLIR-based AI Kernel Compiler

This paper analyzes the effectiveness of latency hiding and parallelism techniques in an MLIR-based AI kernel compiler, focusing on vecto...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.20200] Global Prior Meets Local Consistency: Dual-Memory Augmented Vision-Language-Action Model for Efficient Robotic Manipulation

The paper presents OptimusVLA, a dual-memory framework for robotic manipulation that enhances efficiency and robustness in action generat...

arXiv - AI · 4 min · about 1 month ago

Ai Safety

[2602.20196] OpenPort Protocol: A Security Governance Specification for AI Agent Tool Access

The OpenPort Protocol introduces a governance-first approach for AI agents, ensuring secure access to application tools while addressing ...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.20191] MoBiQuant: Mixture-of-Bits Quantization for Token-Adaptive Elastic LLMs

The paper presents MoBiQuant, a novel quantization framework for elastic large language models (LLMs) that adapts weight precision based ...

arXiv - Machine Learning · 4 min · about 1 month ago

Robotics

[2602.20169] Autonomous AI and Ownership Rules

This article explores the ownership rules surrounding AI-generated outputs, examining how they are linked to their creators and the impli...

arXiv - AI · 3 min · about 1 month ago

Ai Infrastructure

[2601.12815] Multimodal Multi-Agent Empowered Legal Judgment Prediction

This paper presents JurisMMA, a novel framework for Legal Judgment Prediction (LJP) that utilizes multimodal data to enhance the accuracy...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.21061] Tool Building as a Path to "Superintelligence"

The paper explores how Large Language Models (LLMs) can achieve superintelligence through the Diligent Learner framework, emphasizing the...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.20934] Architecting AgentOS: From Token-Level Context to Emergent System-Level Intelligence

The paper introduces AgentOS, a conceptual framework that transitions Large Language Models from static inference engines to dynamic cogn...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.20732] CHESS: Context-aware Hierarchical Efficient Semantic Selection for Long-Context LLM Inference

The paper presents CHESS, a novel KV-cache management system designed for long-context LLM inference, enhancing efficiency and throughput...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.20728] Balancing Multiple Objectives in Urban Traffic Control with Reinforcement Learning from AI Feedback

This paper explores the use of reinforcement learning from AI feedback (RLAIF) to balance multiple objectives in urban traffic control, a...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.20710] Counterfactual Simulation Training for Chain-of-Thought Faithfulness

The paper introduces Counterfactual Simulation Training (CST), a method designed to enhance Chain-of-Thought (CoT) faithfulness in large ...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.20708] ICON: Indirect Prompt Injection Defense for Agents based on Inference-Time Correction

The paper introduces ICON, a novel framework designed to defend Large Language Model (LLM) agents against Indirect Prompt Injection (IPI)...

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.20659] Recursive Belief Vision Language Model

The Recursive Belief Vision Language Model (RB-VLA) addresses limitations in current vision-language-action models by introducing a belie...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.20628] When can we trust untrusted monitoring? A safety case sketch across collusion strategies

This paper explores the challenges of ensuring safety in AI systems using untrusted monitoring. It develops a taxonomy of collusion strat...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2602.20571] CausalReasoningBenchmark: A Real-World Benchmark for Disentangled Evaluation of Causal Identification and Estimation

The CausalReasoningBenchmark introduces a new framework for evaluating automated causal inference, distinguishing between identification ...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.20502] ActionEngine: From Reactive to Programmatic GUI Agents via State Machine Memory

The paper presents ActionEngine, a novel framework that enhances GUI agents by transitioning from reactive execution to programmatic plan...

arXiv - Machine Learning · 4 min · about 1 month ago

Previous Page 85 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence

Most people are using AI wrong—and it’s capping what they can do

Most people are using AI wrong—and it’s capping what they can do

All Content

[2602.20217] KnapSpec: Self-Speculative Decoding via Adaptive Layer Selection as a Knapsack Problem

[2602.20214] Right to History: A Sovereignty Kernel for Verifiable AI Agent Execution

[2602.20208] Model Merging in the Essential Subspace

[2602.20207] Golden Layers and Where to Find Them: Improved Knowledge Editing for Large Language Models Via Layer Gradient Analysis

[2602.20204] Analyzing Latency Hiding and Parallelism in an MLIR-based AI Kernel Compiler

[2602.20200] Global Prior Meets Local Consistency: Dual-Memory Augmented Vision-Language-Action Model for Efficient Robotic Manipulation

[2602.20196] OpenPort Protocol: A Security Governance Specification for AI Agent Tool Access

[2602.20191] MoBiQuant: Mixture-of-Bits Quantization for Token-Adaptive Elastic LLMs

[2602.20169] Autonomous AI and Ownership Rules

[2601.12815] Multimodal Multi-Agent Empowered Legal Judgment Prediction

[2602.21061] Tool Building as a Path to "Superintelligence"

[2602.20934] Architecting AgentOS: From Token-Level Context to Emergent System-Level Intelligence

[2602.20732] CHESS: Context-aware Hierarchical Efficient Semantic Selection for Long-Context LLM Inference

[2602.20728] Balancing Multiple Objectives in Urban Traffic Control with Reinforcement Learning from AI Feedback

[2602.20710] Counterfactual Simulation Training for Chain-of-Thought Faithfulness

[2602.20708] ICON: Indirect Prompt Injection Defense for Agents based on Inference-Time Correction

[2602.20659] Recursive Belief Vision Language Model

[2602.20628] When can we trust untrusted monitoring? A safety case sketch across collusion strategies

[2602.20571] CausalReasoningBenchmark: A Real-World Benchmark for Disentangled Evaluation of Causal Identification and Estimation

[2602.20502] ActionEngine: From Reactive to Programmatic GUI Agents via State Machine Memory

Related Topics

Stay updated with AI News