AI Infrastructure

GPUs, training clusters, MLOps, and deployment

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Ai Infrastructure

Mythos SI (Structured Intelligence): Technical Evidence, Coordinated Criticism, and What the Pattern Actually Shows

Perplexity just ran a structural analysis on the criticism campaign against my work. What it found: synchronized language across posts, n...

Reddit - Artificial Intelligence · 1 min ·
Llms

LLM Guard scored 0/8 detecting a Crescendo multi-turn attack. Arc Sentry flagged it at Turn 3.

Crescendo (Russinovich et al., USENIX Security 2025) is a multi-turn jailbreak that starts with innocent questions and gradually steers a...

Reddit - Artificial Intelligence · 1 min ·

All Content

[2602.02660] MARS: Modular Agent with Reflective Search for Automated AI Research
Llms

[2602.02660] MARS: Modular Agent with Reflective Search for Automated AI Research

The paper introduces MARS, a Modular Agent designed for automated AI research, emphasizing cost-aware planning and reflective memory to e...

arXiv - AI · 3 min ·
[2509.21199] A Fano-Style Accuracy Upper Bound for LLM Single-Pass Reasoning in Multi-Hop QA
Llms

[2509.21199] A Fano-Style Accuracy Upper Bound for LLM Single-Pass Reasoning in Multi-Hop QA

This paper presents a theoretical framework establishing a Fano-style accuracy upper bound for single-pass reasoning in multi-hop questio...

arXiv - AI · 4 min ·
[2509.07997] Learning-Based Planning for Improving Science Return of Earth Observation Satellites
Ai Infrastructure

[2509.07997] Learning-Based Planning for Improving Science Return of Earth Observation Satellites

The paper presents learning-based approaches to dynamic targeting for Earth observation satellites, demonstrating improved scientific dat...

arXiv - AI · 4 min ·
[2508.00576] MultiSHAP: A Shapley-Based Framework for Explaining Cross-Modal Interactions in Multimodal AI Models
Machine Learning

[2508.00576] MultiSHAP: A Shapley-Based Framework for Explaining Cross-Modal Interactions in Multimodal AI Models

MultiSHAP introduces a Shapley-based framework for explaining interactions in multimodal AI models, enhancing interpretability and trustw...

arXiv - AI · 4 min ·
[2602.11325] Amortised and provably-robust simulation-based inference
Machine Learning

[2602.11325] Amortised and provably-robust simulation-based inference

This paper presents a novel method for simulation-based inference that is robust to outliers and simplifies computation by eliminating th...

arXiv - Machine Learning · 3 min ·
[2507.06134] OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety
Ai Infrastructure

[2507.06134] OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety

OpenAgentSafety introduces a modular framework for evaluating AI agent safety in real-world tasks, addressing critical vulnerabilities in...

arXiv - AI · 4 min ·
[2602.01872] Grappa: Gradient-Only Communication for Scalable Graph Neural Network Training
Machine Learning

[2602.01872] Grappa: Gradient-Only Communication for Scalable Graph Neural Network Training

Grappa introduces a gradient-only communication framework for scalable training of Graph Neural Networks (GNNs), improving speed and accu...

arXiv - Machine Learning · 4 min ·
[2602.01664] FlowSteer: Interactive Agentic Workflow Orchestration via End-to-End Reinforcement Learning
Llms

[2602.01664] FlowSteer: Interactive Agentic Workflow Orchestration via End-to-End Reinforcement Learning

FlowSteer introduces an end-to-end reinforcement learning framework for automating workflow orchestration, addressing challenges like man...

arXiv - Machine Learning · 4 min ·
[2602.15811] Task-Agnostic Continual Learning for Chest Radiograph Classification
Machine Learning

[2602.15811] Task-Agnostic Continual Learning for Chest Radiograph Classification

This article presents CARL-XRay, a novel continual learning framework for chest radiograph classification that adapts to new datasets wit...

arXiv - AI · 4 min ·
[2510.02348] mini-vec2vec: Scaling Universal Geometry Alignment with Linear Transformations
Nlp

[2510.02348] mini-vec2vec: Scaling Universal Geometry Alignment with Linear Transformations

The paper introduces mini-vec2vec, an efficient method for aligning text embedding spaces using linear transformations, significantly imp...

arXiv - AI · 3 min ·
[2510.01143] Generalized Parallel Scaling with Interdependent Generations
Llms

[2510.01143] Generalized Parallel Scaling with Interdependent Generations

The paper presents a novel approach, Bridge, for parallel scaling in LLM inference that generates interdependent responses, enhancing acc...

arXiv - Machine Learning · 3 min ·
[2510.00565] Toward Safer Diffusion Language Models: Discovery and Mitigation of Priming Vulnerability
Llms

[2510.00565] Toward Safer Diffusion Language Models: Discovery and Mitigation of Priming Vulnerability

This paper explores vulnerabilities in diffusion language models (DLMs) related to priming attacks and proposes a novel safety alignment ...

arXiv - Machine Learning · 4 min ·
[2509.14461] Learning depth-3 circuits via quantum agnostic boosting
Ai Infrastructure

[2509.14461] Learning depth-3 circuits via quantum agnostic boosting

This article introduces quantum agnostic learning protocols for depth-3 circuits, showcasing a quantum agnostic boosting method that enha...

arXiv - Machine Learning · 4 min ·
[2602.15721] Lifelong Scalable Multi-Agent Realistic Testbed and A Comprehensive Study on Design Choices in Lifelong AGV Fleet Management Systems
Ai Agents

[2602.15721] Lifelong Scalable Multi-Agent Realistic Testbed and A Comprehensive Study on Design Choices in Lifelong AGV Fleet Management Systems

The paper presents LSMART, an open-source simulator for evaluating Multi-Agent Path Finding (MAPF) algorithms in Automated Guided Vehicle...

arXiv - AI · 4 min ·
[2507.01110] A LoD of Gaussians: Unified Training and Rendering for Ultra-Large Scale Reconstruction with External Memory
Machine Learning

[2507.01110] A LoD of Gaussians: Unified Training and Rendering for Ultra-Large Scale Reconstruction with External Memory

The paper presents a novel framework, A LoD of Gaussians, for ultra-large-scale scene reconstruction and rendering using Gaussian splatti...

arXiv - Machine Learning · 4 min ·
[2410.11855] Online GPU Energy Optimization with Switching-Aware Bandits
Machine Learning

[2410.11855] Online GPU Energy Optimization with Switching-Aware Bandits

This paper presents EnergyUCB, a novel online GPU energy optimization method using a multi-armed bandit approach to balance performance a...

arXiv - Machine Learning · 4 min ·
[2602.15564] Beyond Static Pipelines: Learning Dynamic Workflows for Text-to-SQL
Machine Learning

[2602.15564] Beyond Static Pipelines: Learning Dynamic Workflows for Text-to-SQL

The paper presents a novel approach to Text-to-SQL systems by introducing dynamic workflows that adapt during inference, enhancing perfor...

arXiv - AI · 3 min ·
[2405.20178] Non-intrusive data-driven model order reduction for circuits based on Hammerstein architectures
Machine Learning

[2405.20178] Non-intrusive data-driven model order reduction for circuits based on Hammerstein architectures

This paper presents a non-intrusive data-driven model order reduction method for circuits using Hammerstein architectures, demonstrating ...

arXiv - Machine Learning · 4 min ·
[2602.15549] VLM-DEWM: Dynamic External World Model for Verifiable and Resilient Vision-Language Planning in Manufacturing
Llms

[2602.15549] VLM-DEWM: Dynamic External World Model for Verifiable and Resilient Vision-Language Planning in Manufacturing

The paper introduces VLM-DEWM, a novel cognitive architecture designed to enhance vision-language planning in manufacturing by addressing...

arXiv - AI · 4 min ·
[2602.08032] Horizon Imagination: Efficient On-Policy Rollout in Diffusion World Models
Machine Learning

[2602.08032] Horizon Imagination: Efficient On-Policy Rollout in Diffusion World Models

The paper presents Horizon Imagination (HI), an innovative on-policy imagination process for reinforcement learning using diffusion-based...

arXiv - Machine Learning · 3 min ·
Previous Page 147 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime