AI Infrastructure

GPUs, training clusters, MLOps, and deployment

Top This Week

Machine Learning

Is "live AI video generation" a meaningful technical category or just a marketing term? [R]

Asking from a technical standpoint because I feel like the term is doing a lot of work in coverage of this space right now. Genuine real-...

Reddit - Machine Learning · 1 min ·
Ai Infrastructure

FlashAttention (FA1–FA4) in PyTorch - educational implementations focused on algorithmic differences [P]

I recently updated my FlashAttention-PyTorch repo so it now includes educational implementations of FA1, FA2, FA3, and FA4 in plain PyTor...

Reddit - Machine Learning · 1 min ·
UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·

All Content

[2509.22211] LogiPart: Local Large Language Models for Data Exploration at Scale with Logical Partitioning
Llms

[2509.22211] LogiPart: Local Large Language Models for Data Exploration at Scale with Logical Partitioning

LogiPart introduces a scalable framework for data exploration using local large language models, enhancing the efficiency of taxonomic di...

arXiv - AI · 4 min ·
[2504.06438] Don't Let It Hallucinate: Premise Verification via Retrieval-Augmented Logical Reasoning
Llms

[2504.06438] Don't Let It Hallucinate: Premise Verification via Retrieval-Augmented Logical Reasoning

The paper presents a novel framework for premise verification in large language models (LLMs) to reduce hallucinations by using retrieval...

arXiv - AI · 4 min ·
[2408.00539] Intermittent Semi-Working Mask: A New Masking Paradigm for LLMs
Llms

[2408.00539] Intermittent Semi-Working Mask: A New Masking Paradigm for LLMs

The paper introduces the Intermittent Semi-Working Mask (ISM), a novel masking paradigm for Large Language Models (LLMs) that enhances mu...

arXiv - AI · 4 min ·
[2309.08615] Energy Concerns with HPC Systems and Applications
Ai Infrastructure

[2309.08615] Energy Concerns with HPC Systems and Applications

The paper discusses the critical energy concerns associated with High-Performance Computing (HPC) systems and applications, emphasizing t...

arXiv - AI · 4 min ·
[2602.12249] "Sorry, I Didn't Catch That": How Speech Models Miss What Matters Most
Machine Learning

[2602.12249] "Sorry, I Didn't Catch That": How Speech Models Miss What Matters Most

This paper examines the shortcomings of speech recognition models in accurately transcribing high-stakes utterances, particularly U.S. st...

arXiv - AI · 4 min ·
[2602.02660] MARS: Modular Agent with Reflective Search for Automated AI Research
Llms

[2602.02660] MARS: Modular Agent with Reflective Search for Automated AI Research

The paper introduces MARS, a Modular Agent designed for automated AI research, emphasizing cost-aware planning and reflective memory to e...

arXiv - AI · 3 min ·
[2509.21199] A Fano-Style Accuracy Upper Bound for LLM Single-Pass Reasoning in Multi-Hop QA
Llms

[2509.21199] A Fano-Style Accuracy Upper Bound for LLM Single-Pass Reasoning in Multi-Hop QA

This paper presents a theoretical framework establishing a Fano-style accuracy upper bound for single-pass reasoning in multi-hop questio...

arXiv - AI · 4 min ·
[2509.07997] Learning-Based Planning for Improving Science Return of Earth Observation Satellites
Ai Infrastructure

[2509.07997] Learning-Based Planning for Improving Science Return of Earth Observation Satellites

The paper presents learning-based approaches to dynamic targeting for Earth observation satellites, demonstrating improved scientific dat...

arXiv - AI · 4 min ·
[2508.00576] MultiSHAP: A Shapley-Based Framework for Explaining Cross-Modal Interactions in Multimodal AI Models
Machine Learning

[2508.00576] MultiSHAP: A Shapley-Based Framework for Explaining Cross-Modal Interactions in Multimodal AI Models

MultiSHAP introduces a Shapley-based framework for explaining interactions in multimodal AI models, enhancing interpretability and trustw...

arXiv - AI · 4 min ·
[2602.11325] Amortised and provably-robust simulation-based inference
Machine Learning

[2602.11325] Amortised and provably-robust simulation-based inference

This paper presents a novel method for simulation-based inference that is robust to outliers and simplifies computation by eliminating th...

arXiv - Machine Learning · 3 min ·
[2507.06134] OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety
Ai Infrastructure

[2507.06134] OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety

OpenAgentSafety introduces a modular framework for evaluating AI agent safety in real-world tasks, addressing critical vulnerabilities in...

arXiv - AI · 4 min ·
[2602.01872] Grappa: Gradient-Only Communication for Scalable Graph Neural Network Training
Machine Learning

[2602.01872] Grappa: Gradient-Only Communication for Scalable Graph Neural Network Training

Grappa introduces a gradient-only communication framework for scalable training of Graph Neural Networks (GNNs), improving speed and accu...

arXiv - Machine Learning · 4 min ·
[2602.01664] FlowSteer: Interactive Agentic Workflow Orchestration via End-to-End Reinforcement Learning
Llms

[2602.01664] FlowSteer: Interactive Agentic Workflow Orchestration via End-to-End Reinforcement Learning

FlowSteer introduces an end-to-end reinforcement learning framework for automating workflow orchestration, addressing challenges like man...

arXiv - Machine Learning · 4 min ·
[2602.15811] Task-Agnostic Continual Learning for Chest Radiograph Classification
Machine Learning

[2602.15811] Task-Agnostic Continual Learning for Chest Radiograph Classification

This article presents CARL-XRay, a novel continual learning framework for chest radiograph classification that adapts to new datasets wit...

arXiv - AI · 4 min ·
[2510.02348] mini-vec2vec: Scaling Universal Geometry Alignment with Linear Transformations
Nlp

[2510.02348] mini-vec2vec: Scaling Universal Geometry Alignment with Linear Transformations

The paper introduces mini-vec2vec, an efficient method for aligning text embedding spaces using linear transformations, significantly imp...

arXiv - AI · 3 min ·
[2510.01143] Generalized Parallel Scaling with Interdependent Generations
Llms

[2510.01143] Generalized Parallel Scaling with Interdependent Generations

The paper presents a novel approach, Bridge, for parallel scaling in LLM inference that generates interdependent responses, enhancing acc...

arXiv - Machine Learning · 3 min ·
[2510.00565] Toward Safer Diffusion Language Models: Discovery and Mitigation of Priming Vulnerability
Llms

[2510.00565] Toward Safer Diffusion Language Models: Discovery and Mitigation of Priming Vulnerability

This paper explores vulnerabilities in diffusion language models (DLMs) related to priming attacks and proposes a novel safety alignment ...

arXiv - Machine Learning · 4 min ·
[2509.14461] Learning depth-3 circuits via quantum agnostic boosting
Ai Infrastructure

[2509.14461] Learning depth-3 circuits via quantum agnostic boosting

This article introduces quantum agnostic learning protocols for depth-3 circuits, showcasing a quantum agnostic boosting method that enha...

arXiv - Machine Learning · 4 min ·
[2602.15721] Lifelong Scalable Multi-Agent Realistic Testbed and A Comprehensive Study on Design Choices in Lifelong AGV Fleet Management Systems
Ai Agents

[2602.15721] Lifelong Scalable Multi-Agent Realistic Testbed and A Comprehensive Study on Design Choices in Lifelong AGV Fleet Management Systems

The paper presents LSMART, an open-source simulator for evaluating Multi-Agent Path Finding (MAPF) algorithms in Automated Guided Vehicle...

arXiv - AI · 4 min ·
[2507.01110] A LoD of Gaussians: Unified Training and Rendering for Ultra-Large Scale Reconstruction with External Memory
Machine Learning

[2507.01110] A LoD of Gaussians: Unified Training and Rendering for Ultra-Large Scale Reconstruction with External Memory

The paper presents a novel framework, A LoD of Gaussians, for ultra-large-scale scene reconstruction and rendering using Gaussian splatti...

arXiv - Machine Learning · 4 min ·
Previous Page 135 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime