AI Infrastructure

GPUs, training clusters, MLOps, and deployment

Top This Week

Machine Learning

Is "live AI video generation" a meaningful technical category or just a marketing term? [R]

Asking from a technical standpoint because I feel like the term is doing a lot of work in coverage of this space right now. Genuine real-...

Reddit - Machine Learning · 1 min ·
Ai Infrastructure

FlashAttention (FA1–FA4) in PyTorch - educational implementations focused on algorithmic differences [P]

I recently updated my FlashAttention-PyTorch repo so it now includes educational implementations of FA1, FA2, FA3, and FA4 in plain PyTor...

Reddit - Machine Learning · 1 min ·
UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·

All Content

[2411.18954] NeuroLifting: Neural Inference on Markov Random Fields at Scale
Machine Learning

[2411.18954] NeuroLifting: Neural Inference on Markov Random Fields at Scale

NeuroLifting introduces a novel approach for inference in large-scale Markov Random Fields (MRFs) using Graph Neural Networks, achieving ...

arXiv - AI · 4 min ·
[2602.15197] OpaqueToolsBench: Learning Nuances of Tool Behavior Through Interaction
Llms

[2602.15197] OpaqueToolsBench: Learning Nuances of Tool Behavior Through Interaction

The paper introduces OpaqueToolsBench, a benchmark for evaluating Large Language Model (LLM) agents' performance with opaque tools, propo...

arXiv - AI · 3 min ·
[2602.15756] A Note on Non-Composability of Layerwise Approximate Verification for Neural Inference
Machine Learning

[2602.15756] A Note on Non-Composability of Layerwise Approximate Verification for Neural Inference

This paper discusses the limitations of layerwise approximate verification in neural inference, presenting a counterexample that challeng...

arXiv - Machine Learning · 3 min ·
[2602.15751] Enabling Low-Latency Machine learning on Radiation-Hard FPGAs with hls4ml
Machine Learning

[2602.15751] Enabling Low-Latency Machine learning on Radiation-Hard FPGAs with hls4ml

This article presents a novel approach to implementing low-latency machine learning on radiation-hard FPGAs, demonstrating its applicatio...

arXiv - Machine Learning · 4 min ·
[2602.15707] Proactive Conversational Assistant for a Procedural Manual Task based on Audio and IMU
Ai Infrastructure

[2602.15707] Proactive Conversational Assistant for a Procedural Manual Task based on Audio and IMU

This article presents a novel real-time conversational assistant that utilizes audio and IMU data to guide users through procedural tasks...

arXiv - Machine Learning · 4 min ·
[2602.15061] Safe-SDL:Establishing Safety Boundaries and Control Mechanisms for AI-Driven Self-Driving Laboratories
Robotics

[2602.15061] Safe-SDL:Establishing Safety Boundaries and Control Mechanisms for AI-Driven Self-Driving Laboratories

The paper presents Safe-SDL, a framework for ensuring safety in AI-driven Self-Driving Laboratories, addressing the critical 'Syntax-to-S...

arXiv - AI · 4 min ·
[2602.15055] Beyond Context Sharing: A Unified Agent Communication Protocol (ACP) for Secure, Federated, and Autonomous Agent-to-Agent (A2A) Orchestration
Llms

[2602.15055] Beyond Context Sharing: A Unified Agent Communication Protocol (ACP) for Secure, Federated, and Autonomous Agent-to-Agent (A2A) Orchestration

The paper introduces the Agent Communication Protocol (ACP), a framework for secure and efficient agent-to-agent orchestration, addressin...

arXiv - AI · 3 min ·
[2602.15521] ExpertWeaver: Unlocking the Inherent MoE in Dense LLMs with GLU Activation Patterns
Llms

[2602.15521] ExpertWeaver: Unlocking the Inherent MoE in Dense LLMs with GLU Activation Patterns

The paper presents ExpertWeaver, a framework that enhances the conversion of dense LLMs into sparse Mixture-of-Experts (MoE) models using...

arXiv - Machine Learning · 4 min ·
[2602.15472] Fluids You Can Trust: Property-Preserving Operator Learning for Incompressible Flows
Machine Learning

[2602.15472] Fluids You Can Trust: Property-Preserving Operator Learning for Incompressible Flows

This article introduces a novel operator learning method for incompressible flows, enhancing computational efficiency while preserving es...

arXiv - Machine Learning · 3 min ·
[2602.15036] Transforming Computational Lithography with AC and AI -- Faster, More Accurate, and Energy-efficient
Machine Learning

[2602.15036] Transforming Computational Lithography with AC and AI -- Faster, More Accurate, and Energy-efficient

This article discusses the integration of accelerated computing (AC) and artificial intelligence (AI) in computational lithography, highl...

arXiv - AI · 4 min ·
[2602.15470] The Skeletal Trap: Mapping Spatial Inequality and Ghost Stops in Ankara's Transit Network
Nlp

[2602.15470] The Skeletal Trap: Mapping Spatial Inequality and Ghost Stops in Ankara's Transit Network

This article explores Ankara's public transport crisis, attributing it to structural issues rather than mere inefficiencies. It highlight...

arXiv - Machine Learning · 3 min ·
[2602.15423] GaiaFlow: Semantic-Guided Diffusion Tuning for Carbon-Frugal Search
Machine Learning

[2602.15423] GaiaFlow: Semantic-Guided Diffusion Tuning for Carbon-Frugal Search

GaiaFlow presents a novel framework for carbon-efficient search, employing semantic-guided diffusion tuning to balance retrieval accuracy...

arXiv - Machine Learning · 3 min ·
[2602.15785] This human study did not involve human subjects: Validating LLM simulations as behavioral evidence
Llms

[2602.15785] This human study did not involve human subjects: Validating LLM simulations as behavioral evidence

This article discusses the use of large language models (LLMs) as synthetic participants in social science experiments, evaluating their ...

arXiv - AI · 4 min ·
[2602.15379] FlashMem: Supporting Modern DNN Workloads on Mobile with GPU Memory Hierarchy Optimizations
Machine Learning

[2602.15379] FlashMem: Supporting Modern DNN Workloads on Mobile with GPU Memory Hierarchy Optimizations

The paper presents FlashMem, a memory streaming framework designed to optimize the execution of large-scale deep neural networks (DNNs) o...

arXiv - Machine Learning · 4 min ·
[2602.15669] PERSONA: Dynamic and Compositional Inference-Time Personality Control via Activation Vector Algebra
Llms

[2602.15669] PERSONA: Dynamic and Compositional Inference-Time Personality Control via Activation Vector Algebra

The paper introduces PERSONA, a novel framework for dynamic personality control in Large Language Models (LLMs) using activation vector a...

arXiv - AI · 4 min ·
[2602.15326] SCENE OTA-FD: Self-Centering Noncoherent Estimator for Over-the-Air Federated Distillation
Ai Safety

[2602.15326] SCENE OTA-FD: Self-Centering Noncoherent Estimator for Over-the-Air Federated Distillation

The paper presents SCENE, a novel estimator for over-the-air federated distillation that enhances aggregation without requiring pilot sig...

arXiv - Machine Learning · 3 min ·
[2602.15277] Accelerating Large-Scale Dataset Distillation via Exploration-Exploitation Optimization
Machine Learning

[2602.15277] Accelerating Large-Scale Dataset Distillation via Exploration-Exploitation Optimization

This paper presents Exploration-Exploitation Distillation (E^2D), a method for efficient large-scale dataset distillation that balances a...

arXiv - Machine Learning · 4 min ·
[2602.15391] Improving LLM Reliability through Hybrid Abstention and Adaptive Detection
Llms

[2602.15391] Improving LLM Reliability through Hybrid Abstention and Adaptive Detection

The paper presents a novel adaptive abstention system for Large Language Models (LLMs) that balances safety and utility by dynamically ad...

arXiv - AI · 4 min ·
[2602.15161] Exploiting Layer-Specific Vulnerabilities to Backdoor Attack in Federated Learning
Machine Learning

[2602.15161] Exploiting Layer-Specific Vulnerabilities to Backdoor Attack in Federated Learning

This paper presents the Layer Smoothing Attack (LSA), a novel backdoor attack exploiting layer-specific vulnerabilities in federated lear...

arXiv - Machine Learning · 4 min ·
[2602.15136] Universal priors: solving empirical Bayes via Bayesian inference and pretraining
Machine Learning

[2602.15136] Universal priors: solving empirical Bayes via Bayesian inference and pretraining

The paper explores how a pretrained transformer can effectively solve empirical Bayes problems by leveraging universal priors, demonstrat...

arXiv - Machine Learning · 3 min ·
Previous Page 137 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime