AI Infrastructure

GPUs, training clusters, MLOps, and deployment

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Ai Infrastructure

Most people are using AI wrong—and it’s capping what they can do

1 is a fluke. 2 is a coincidence. 3 is a pattern. Lately I’ve been noticing something. The problems I’m solving are getting more complex…...

Reddit - Artificial Intelligence · 1 min ·
Ai Infrastructure

Most people are using AI wrong—and it’s capping what they can do

1 is a fluke. 2 is a coincidence. 3 is a pattern. Lately I’ve been noticing something. The problems I’m solving are getting more complex…...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2602.20217] KnapSpec: Self-Speculative Decoding via Adaptive Layer Selection as a Knapsack Problem
Llms

[2602.20217] KnapSpec: Self-Speculative Decoding via Adaptive Layer Selection as a Knapsack Problem

The paper introduces KnapSpec, a framework for self-speculative decoding that optimizes layer selection in LLMs as a knapsack problem, en...

arXiv - Machine Learning · 4 min ·
[2602.20214] Right to History: A Sovereignty Kernel for Verifiable AI Agent Execution
Ai Safety

[2602.20214] Right to History: A Sovereignty Kernel for Verifiable AI Agent Execution

This paper proposes the 'Right to History,' a principle ensuring individuals have a verifiable record of AI agent actions on personal har...

arXiv - AI · 3 min ·
[2602.20208] Model Merging in the Essential Subspace
Machine Learning

[2602.20208] Model Merging in the Essential Subspace

This paper presents ESM, a novel framework for merging multiple task-specific models into a single multi-task model, addressing inter-tas...

arXiv - Machine Learning · 3 min ·
[2602.20207] Golden Layers and Where to Find Them: Improved Knowledge Editing for Large Language Models Via Layer Gradient Analysis
Llms

[2602.20207] Golden Layers and Where to Find Them: Improved Knowledge Editing for Large Language Models Via Layer Gradient Analysis

This article discusses the concept of 'golden layers' in large language models (LLMs) and presents a novel method, Layer Gradient Analysi...

arXiv - AI · 4 min ·
[2602.20204] Analyzing Latency Hiding and Parallelism in an MLIR-based AI Kernel Compiler
Machine Learning

[2602.20204] Analyzing Latency Hiding and Parallelism in an MLIR-based AI Kernel Compiler

This paper analyzes the effectiveness of latency hiding and parallelism techniques in an MLIR-based AI kernel compiler, focusing on vecto...

arXiv - AI · 3 min ·
[2602.20200] Global Prior Meets Local Consistency: Dual-Memory Augmented Vision-Language-Action Model for Efficient Robotic Manipulation
Machine Learning

[2602.20200] Global Prior Meets Local Consistency: Dual-Memory Augmented Vision-Language-Action Model for Efficient Robotic Manipulation

The paper presents OptimusVLA, a dual-memory framework for robotic manipulation that enhances efficiency and robustness in action generat...

arXiv - AI · 4 min ·
[2602.20196] OpenPort Protocol: A Security Governance Specification for AI Agent Tool Access
Ai Safety

[2602.20196] OpenPort Protocol: A Security Governance Specification for AI Agent Tool Access

The OpenPort Protocol introduces a governance-first approach for AI agents, ensuring secure access to application tools while addressing ...

arXiv - AI · 4 min ·
[2602.20191] MoBiQuant: Mixture-of-Bits Quantization for Token-Adaptive Elastic LLMs
Llms

[2602.20191] MoBiQuant: Mixture-of-Bits Quantization for Token-Adaptive Elastic LLMs

The paper presents MoBiQuant, a novel quantization framework for elastic large language models (LLMs) that adapts weight precision based ...

arXiv - Machine Learning · 4 min ·
[2602.20169] Autonomous AI and Ownership Rules
Robotics

[2602.20169] Autonomous AI and Ownership Rules

This article explores the ownership rules surrounding AI-generated outputs, examining how they are linked to their creators and the impli...

arXiv - AI · 3 min ·
[2601.12815] Multimodal Multi-Agent Empowered Legal Judgment Prediction
Ai Infrastructure

[2601.12815] Multimodal Multi-Agent Empowered Legal Judgment Prediction

This paper presents JurisMMA, a novel framework for Legal Judgment Prediction (LJP) that utilizes multimodal data to enhance the accuracy...

arXiv - AI · 4 min ·
[2602.21061] Tool Building as a Path to "Superintelligence"
Llms

[2602.21061] Tool Building as a Path to "Superintelligence"

The paper explores how Large Language Models (LLMs) can achieve superintelligence through the Diligent Learner framework, emphasizing the...

arXiv - AI · 3 min ·
[2602.20934] Architecting AgentOS: From Token-Level Context to Emergent System-Level Intelligence
Llms

[2602.20934] Architecting AgentOS: From Token-Level Context to Emergent System-Level Intelligence

The paper introduces AgentOS, a conceptual framework that transitions Large Language Models from static inference engines to dynamic cogn...

arXiv - AI · 3 min ·
[2602.20732] CHESS: Context-aware Hierarchical Efficient Semantic Selection for Long-Context LLM Inference
Llms

[2602.20732] CHESS: Context-aware Hierarchical Efficient Semantic Selection for Long-Context LLM Inference

The paper presents CHESS, a novel KV-cache management system designed for long-context LLM inference, enhancing efficiency and throughput...

arXiv - AI · 3 min ·
[2602.20728] Balancing Multiple Objectives in Urban Traffic Control with Reinforcement Learning from AI Feedback
Llms

[2602.20728] Balancing Multiple Objectives in Urban Traffic Control with Reinforcement Learning from AI Feedback

This paper explores the use of reinforcement learning from AI feedback (RLAIF) to balance multiple objectives in urban traffic control, a...

arXiv - AI · 3 min ·
[2602.20710] Counterfactual Simulation Training for Chain-of-Thought Faithfulness
Llms

[2602.20710] Counterfactual Simulation Training for Chain-of-Thought Faithfulness

The paper introduces Counterfactual Simulation Training (CST), a method designed to enhance Chain-of-Thought (CoT) faithfulness in large ...

arXiv - AI · 4 min ·
[2602.20708] ICON: Indirect Prompt Injection Defense for Agents based on Inference-Time Correction
Llms

[2602.20708] ICON: Indirect Prompt Injection Defense for Agents based on Inference-Time Correction

The paper introduces ICON, a novel framework designed to defend Large Language Model (LLM) agents against Indirect Prompt Injection (IPI)...

arXiv - AI · 3 min ·
[2602.20659] Recursive Belief Vision Language Model
Llms

[2602.20659] Recursive Belief Vision Language Model

The Recursive Belief Vision Language Model (RB-VLA) addresses limitations in current vision-language-action models by introducing a belie...

arXiv - AI · 4 min ·
[2602.20628] When can we trust untrusted monitoring? A safety case sketch across collusion strategies
Machine Learning

[2602.20628] When can we trust untrusted monitoring? A safety case sketch across collusion strategies

This paper explores the challenges of ensuring safety in AI systems using untrusted monitoring. It develops a taxonomy of collusion strat...

arXiv - AI · 4 min ·
[2602.20571] CausalReasoningBenchmark: A Real-World Benchmark for Disentangled Evaluation of Causal Identification and Estimation
Machine Learning

[2602.20571] CausalReasoningBenchmark: A Real-World Benchmark for Disentangled Evaluation of Causal Identification and Estimation

The CausalReasoningBenchmark introduces a new framework for evaluating automated causal inference, distinguishing between identification ...

arXiv - AI · 4 min ·
[2602.20502] ActionEngine: From Reactive to Programmatic GUI Agents via State Machine Memory
Llms

[2602.20502] ActionEngine: From Reactive to Programmatic GUI Agents via State Machine Memory

The paper presents ActionEngine, a novel framework that enhances GUI agents by transitioning from reactive execution to programmatic plan...

arXiv - Machine Learning · 4 min ·
Previous Page 85 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime