AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · 26 minutes ago

Ai Infrastructure

Mythos SI (Structured Intelligence): Technical Evidence, Coordinated Criticism, and What the Pattern Actually Shows

Perplexity just ran a structural analysis on the criticism campaign against my work. What it found: synchronized language across posts, n...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

Llms

LLM Guard scored 0/8 detecting a Crescendo multi-turn attack. Arc Sentry flagged it at Turn 3.

Crescendo (Russinovich et al., USENIX Security 2025) is a multi-turn jailbreak that starts with innocent questions and gradually steers a...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

All Content

Llms

[2602.02660] MARS: Modular Agent with Reflective Search for Automated AI Research

The paper introduces MARS, a Modular Agent designed for automated AI research, emphasizing cost-aware planning and reflective memory to e...

arXiv - AI · 3 min · about 2 months ago

Llms

[2509.21199] A Fano-Style Accuracy Upper Bound for LLM Single-Pass Reasoning in Multi-Hop QA

This paper presents a theoretical framework establishing a Fano-style accuracy upper bound for single-pass reasoning in multi-hop questio...

arXiv - AI · 4 min · about 2 months ago

Ai Infrastructure

[2509.07997] Learning-Based Planning for Improving Science Return of Earth Observation Satellites

The paper presents learning-based approaches to dynamic targeting for Earth observation satellites, demonstrating improved scientific dat...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2508.00576] MultiSHAP: A Shapley-Based Framework for Explaining Cross-Modal Interactions in Multimodal AI Models

MultiSHAP introduces a Shapley-based framework for explaining interactions in multimodal AI models, enhancing interpretability and trustw...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.11325] Amortised and provably-robust simulation-based inference

This paper presents a novel method for simulation-based inference that is robust to outliers and simplifies computation by eliminating th...

arXiv - Machine Learning · 3 min · about 2 months ago

Ai Infrastructure

[2507.06134] OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety

OpenAgentSafety introduces a modular framework for evaluating AI agent safety in real-world tasks, addressing critical vulnerabilities in...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.01872] Grappa: Gradient-Only Communication for Scalable Graph Neural Network Training

Grappa introduces a gradient-only communication framework for scalable training of Graph Neural Networks (GNNs), improving speed and accu...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.01664] FlowSteer: Interactive Agentic Workflow Orchestration via End-to-End Reinforcement Learning

FlowSteer introduces an end-to-end reinforcement learning framework for automating workflow orchestration, addressing challenges like man...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.15811] Task-Agnostic Continual Learning for Chest Radiograph Classification

This article presents CARL-XRay, a novel continual learning framework for chest radiograph classification that adapts to new datasets wit...

arXiv - AI · 4 min · about 2 months ago

Nlp

[2510.02348] mini-vec2vec: Scaling Universal Geometry Alignment with Linear Transformations

The paper introduces mini-vec2vec, an efficient method for aligning text embedding spaces using linear transformations, significantly imp...

arXiv - AI · 3 min · about 2 months ago

Llms

[2510.01143] Generalized Parallel Scaling with Interdependent Generations

The paper presents a novel approach, Bridge, for parallel scaling in LLM inference that generates interdependent responses, enhancing acc...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2510.00565] Toward Safer Diffusion Language Models: Discovery and Mitigation of Priming Vulnerability

This paper explores vulnerabilities in diffusion language models (DLMs) related to priming attacks and proposes a novel safety alignment ...

arXiv - Machine Learning · 4 min · about 2 months ago

Ai Infrastructure

[2509.14461] Learning depth-3 circuits via quantum agnostic boosting

This article introduces quantum agnostic learning protocols for depth-3 circuits, showcasing a quantum agnostic boosting method that enha...

arXiv - Machine Learning · 4 min · about 2 months ago

Ai Agents

[2602.15721] Lifelong Scalable Multi-Agent Realistic Testbed and A Comprehensive Study on Design Choices in Lifelong AGV Fleet Management Systems

The paper presents LSMART, an open-source simulator for evaluating Multi-Agent Path Finding (MAPF) algorithms in Automated Guided Vehicle...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2507.01110] A LoD of Gaussians: Unified Training and Rendering for Ultra-Large Scale Reconstruction with External Memory

The paper presents a novel framework, A LoD of Gaussians, for ultra-large-scale scene reconstruction and rendering using Gaussian splatti...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2410.11855] Online GPU Energy Optimization with Switching-Aware Bandits

This paper presents EnergyUCB, a novel online GPU energy optimization method using a multi-armed bandit approach to balance performance a...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.15564] Beyond Static Pipelines: Learning Dynamic Workflows for Text-to-SQL

The paper presents a novel approach to Text-to-SQL systems by introducing dynamic workflows that adapt during inference, enhancing perfor...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2405.20178] Non-intrusive data-driven model order reduction for circuits based on Hammerstein architectures

This paper presents a non-intrusive data-driven model order reduction method for circuits using Hammerstein architectures, demonstrating ...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.15549] VLM-DEWM: Dynamic External World Model for Verifiable and Resilient Vision-Language Planning in Manufacturing

The paper introduces VLM-DEWM, a novel cognitive architecture designed to enhance vision-language planning in manufacturing by addressing...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.08032] Horizon Imagination: Efficient On-Policy Rollout in Diffusion World Models

The paper presents Horizon Imagination (HI), an innovative on-policy imagination process for reinforcement learning using diffusion-based...

arXiv - Machine Learning · 3 min · about 2 months ago

Previous Page 147 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence

Mythos SI (Structured Intelligence): Technical Evidence, Coordinated Criticism, and What the Pattern Actually Shows

LLM Guard scored 0/8 detecting a Crescendo multi-turn attack. Arc Sentry flagged it at Turn 3.

All Content

[2602.02660] MARS: Modular Agent with Reflective Search for Automated AI Research

[2509.21199] A Fano-Style Accuracy Upper Bound for LLM Single-Pass Reasoning in Multi-Hop QA

[2509.07997] Learning-Based Planning for Improving Science Return of Earth Observation Satellites

[2508.00576] MultiSHAP: A Shapley-Based Framework for Explaining Cross-Modal Interactions in Multimodal AI Models

[2602.11325] Amortised and provably-robust simulation-based inference

[2507.06134] OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety

[2602.01872] Grappa: Gradient-Only Communication for Scalable Graph Neural Network Training

[2602.01664] FlowSteer: Interactive Agentic Workflow Orchestration via End-to-End Reinforcement Learning

[2602.15811] Task-Agnostic Continual Learning for Chest Radiograph Classification

[2510.02348] mini-vec2vec: Scaling Universal Geometry Alignment with Linear Transformations

[2510.01143] Generalized Parallel Scaling with Interdependent Generations

[2510.00565] Toward Safer Diffusion Language Models: Discovery and Mitigation of Priming Vulnerability

[2509.14461] Learning depth-3 circuits via quantum agnostic boosting

[2602.15721] Lifelong Scalable Multi-Agent Realistic Testbed and A Comprehensive Study on Design Choices in Lifelong AGV Fleet Management Systems

[2507.01110] A LoD of Gaussians: Unified Training and Rendering for Ultra-Large Scale Reconstruction with External Memory

[2410.11855] Online GPU Energy Optimization with Switching-Aware Bandits

[2602.15564] Beyond Static Pipelines: Learning Dynamic Workflows for Text-to-SQL

[2405.20178] Non-intrusive data-driven model order reduction for circuits based on Hammerstein architectures

[2602.15549] VLM-DEWM: Dynamic External World Model for Verifiable and Resilient Vision-Language Planning in Manufacturing

[2602.08032] Horizon Imagination: Efficient On-Policy Rollout in Diffusion World Models

Related Topics

Stay updated with AI News