AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Llms

World models will be the next big thing, bye-bye LLMs

Was at Nvidia's GTC conference recently and honestly, it was one of the most eye-opening events I've attended in a while. There was a lot...

Reddit - Artificial Intelligence · 1 min · 4 minutes ago

Machine Learning

What tools are sr MLEs using? (clawdbot, openspec, wispr) [D]

I'm already blasting cursor, but I want to level up my output. I heard that these kind of AI tools and workflows are being asked in SF. W...

Reddit - Machine Learning · 1 min · about 5 hours ago

Llms

[R] An attack class that passes every current LLM filter - no payload, no injection signature, no log trace

https://shapingrooms.com/research I've been documenting what I'm calling postural manipulation: a specific class of language that install...

Reddit - Machine Learning · 1 min · about 5 hours ago

All Content

Machine Learning

[2603.05234] Recursive Inference Machines for Neural Reasoning

Abstract page for arXiv paper 2603.05234: Recursive Inference Machines for Neural Reasoning

arXiv - Machine Learning · 3 min · 24 days ago

Machine Learning

[2603.05231] Boosting ASR Robustness via Test-Time Reinforcement Learning with Audio-Text Semantic Rewards

Abstract page for arXiv paper 2603.05231: Boosting ASR Robustness via Test-Time Reinforcement Learning with Audio-Text Semantic Rewards

arXiv - Machine Learning · 4 min · 24 days ago

Ai Infrastructure

[2603.05188] Escaping the Hydrolysis Trap: An Agentic Workflow for Inverse Design of Durable Photocatalytic Covalent Organic Frameworks

Abstract page for arXiv paper 2603.05188: Escaping the Hydrolysis Trap: An Agentic Workflow for Inverse Design of Durable Photocatalytic ...

arXiv - AI · 4 min · 24 days ago

Llms

[2603.05167] C2-Faith: Benchmarking LLM Judges for Causal and Coverage Faithfulness in Chain-of-Thought Reasoning

Abstract page for arXiv paper 2603.05167: C2-Faith: Benchmarking LLM Judges for Causal and Coverage Faithfulness in Chain-of-Thought Reas...

arXiv - AI · 3 min · 24 days ago

Robotics

[2603.05160] Lifelong Language-Conditioned Robotic Manipulation Learning

Abstract page for arXiv paper 2603.05160: Lifelong Language-Conditioned Robotic Manipulation Learning

arXiv - AI · 3 min · 24 days ago

Machine Learning

[2603.05111] SPIRIT: Perceptive Shared Autonomy for Robust Robotic Manipulation under Deep Learning Uncertainty

Abstract page for arXiv paper 2603.05111: SPIRIT: Perceptive Shared Autonomy for Robust Robotic Manipulation under Deep Learning Uncertainty

arXiv - AI · 4 min · 24 days ago

Ai Infrastructure

[2603.05099] ARC-TGI: Human-Validated Task Generators with Reasoning Chain Templates for ARC-AGI

Abstract page for arXiv paper 2603.05099: ARC-TGI: Human-Validated Task Generators with Reasoning Chain Templates for ARC-AGI

arXiv - Machine Learning · 4 min · 24 days ago

Ai Infrastructure

[2603.04902] AgentSCOPE: Evaluating Contextual Privacy Across Agentic Workflows

Abstract page for arXiv paper 2603.04902: AgentSCOPE: Evaluating Contextual Privacy Across Agentic Workflows

arXiv - AI · 4 min · 24 days ago

Llms

[2603.04893] Free Lunch for Pass@$k$? Low Cost Diverse Sampling for Diffusion Language Models

Abstract page for arXiv paper 2603.04893: Free Lunch for Pass@$k$? Low Cost Diverse Sampling for Diffusion Language Models

arXiv - AI · 4 min · 24 days ago

Machine Learning

[2603.04890] FedAFD: Multimodal Federated Learning via Adversarial Fusion and Distillation

Abstract page for arXiv paper 2603.04890: FedAFD: Multimodal Federated Learning via Adversarial Fusion and Distillation

arXiv - Machine Learning · 4 min · 24 days ago

Llms

[2603.04464] Understanding the Dynamics of Demonstration Conflict in In-Context Learning

Abstract page for arXiv paper 2603.04464: Understanding the Dynamics of Demonstration Conflict in In-Context Learning

arXiv - Machine Learning · 4 min · 24 days ago

Llms

[2603.04460] VSPrefill: Vertical-Slash Sparse Attention with Lightweight Indexing for Long-Context Prefilling

Abstract page for arXiv paper 2603.04460: VSPrefill: Vertical-Slash Sparse Attention with Lightweight Indexing for Long-Context Prefilling

arXiv - Machine Learning · 3 min · 24 days ago

Llms

[2603.04453] Induced Numerical Instability: Hidden Costs in Multimodal Large Language Models

Abstract page for arXiv paper 2603.04453: Induced Numerical Instability: Hidden Costs in Multimodal Large Language Models

arXiv - Machine Learning · 3 min · 24 days ago

Llms

[2603.04444] vLLM Semantic Router: Signal Driven Decision Routing for Mixture-of-Modality Models

Abstract page for arXiv paper 2603.04444: vLLM Semantic Router: Signal Driven Decision Routing for Mixture-of-Modality Models

arXiv - AI · 4 min · 24 days ago

Llms

[2603.04429] What Is Missing: Interpretable Ratings for Large Language Model Outputs

Abstract page for arXiv paper 2603.04429: What Is Missing: Interpretable Ratings for Large Language Model Outputs

arXiv - AI · 4 min · 24 days ago

Llms

[2603.04428] Agent Memory Below the Prompt: Persistent Q4 KV Cache for Multi-Agent LLM Inference on Edge Devices

Abstract page for arXiv paper 2603.04428: Agent Memory Below the Prompt: Persistent Q4 KV Cache for Multi-Agent LLM Inference on Edge Dev...

arXiv - Machine Learning · 4 min · 24 days ago

Llms

[2603.04411] One Size Does Not Fit All: Token-Wise Adaptive Compression for KV Cache

Abstract page for arXiv paper 2603.04411: One Size Does Not Fit All: Token-Wise Adaptive Compression for KV Cache

arXiv - Machine Learning · 3 min · 24 days ago

Machine Learning

[2603.05414] Dissociating Direct Access from Inference in AI Introspection

Abstract page for arXiv paper 2603.05414: Dissociating Direct Access from Inference in AI Introspection

arXiv - AI · 3 min · 24 days ago

Ai Infrastructure

[2603.05069] Jagarin: A Three-Layer Architecture for Hibernating Personal Duty Agents on Mobile

Abstract page for arXiv paper 2603.05069: Jagarin: A Three-Layer Architecture for Hibernating Personal Duty Agents on Mobile

arXiv - AI · 4 min · 24 days ago

Machine Learning

[2603.05036] The Trilingual Triad Framework: Integrating Design, AI, and Domain Knowledge in No-code AI Smart City Course

Abstract page for arXiv paper 2603.05036: The Trilingual Triad Framework: Integrating Design, AI, and Domain Knowledge in No-code AI Smar...

arXiv - AI · 4 min · 24 days ago

Previous Page 29 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

World models will be the next big thing, bye-bye LLMs

What tools are sr MLEs using? (clawdbot, openspec, wispr) [D]

[R] An attack class that passes every current LLM filter - no payload, no injection signature, no log trace

All Content

[2603.05234] Recursive Inference Machines for Neural Reasoning

[2603.05231] Boosting ASR Robustness via Test-Time Reinforcement Learning with Audio-Text Semantic Rewards

[2603.05188] Escaping the Hydrolysis Trap: An Agentic Workflow for Inverse Design of Durable Photocatalytic Covalent Organic Frameworks

[2603.05167] C2-Faith: Benchmarking LLM Judges for Causal and Coverage Faithfulness in Chain-of-Thought Reasoning

[2603.05160] Lifelong Language-Conditioned Robotic Manipulation Learning

[2603.05111] SPIRIT: Perceptive Shared Autonomy for Robust Robotic Manipulation under Deep Learning Uncertainty

[2603.05099] ARC-TGI: Human-Validated Task Generators with Reasoning Chain Templates for ARC-AGI

[2603.04902] AgentSCOPE: Evaluating Contextual Privacy Across Agentic Workflows

[2603.04893] Free Lunch for Pass@$k$? Low Cost Diverse Sampling for Diffusion Language Models

[2603.04890] FedAFD: Multimodal Federated Learning via Adversarial Fusion and Distillation

[2603.04464] Understanding the Dynamics of Demonstration Conflict in In-Context Learning

[2603.04460] VSPrefill: Vertical-Slash Sparse Attention with Lightweight Indexing for Long-Context Prefilling

[2603.04453] Induced Numerical Instability: Hidden Costs in Multimodal Large Language Models

[2603.04444] vLLM Semantic Router: Signal Driven Decision Routing for Mixture-of-Modality Models

[2603.04429] What Is Missing: Interpretable Ratings for Large Language Model Outputs

[2603.04428] Agent Memory Below the Prompt: Persistent Q4 KV Cache for Multi-Agent LLM Inference on Edge Devices

[2603.04411] One Size Does Not Fit All: Token-Wise Adaptive Compression for KV Cache

[2603.05414] Dissociating Direct Access from Inference in AI Introspection

[2603.05069] Jagarin: A Three-Layer Architecture for Hibernating Personal Duty Agents on Mobile

[2603.05036] The Trilingual Triad Framework: Integrating Design, AI, and Domain Knowledge in No-code AI Smart City Course

Related Topics

Stay updated with AI News