AI Infrastructure

GPUs, training clusters, MLOps, and deployment

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Llms

What is the current landscape on AI agents knowledge

Recently used "free" rates codex to give me a quick fastapi project sample. It gave me deprecated (a)app.on_event("startup). What are you...

Reddit - Artificial Intelligence · 1 min ·
NVIDIA Isaac GR00T N1.7: Open Reasoning VLA Model for Humanoid Robots
Open Source Ai

NVIDIA Isaac GR00T N1.7: Open Reasoning VLA Model for Humanoid Robots

A Blog post by NVIDIA on Hugging Face

Hugging Face Blog · 4 min ·

All Content

[2602.05319] Accelerated Sequential Flow Matching: A Bayesian Filtering Perspective
Machine Learning

[2602.05319] Accelerated Sequential Flow Matching: A Bayesian Filtering Perspective

This paper introduces Accelerated Sequential Flow Matching, a Bayesian filtering framework that enhances real-time inference in stochasti...

arXiv - Machine Learning · 4 min ·
[2602.04942] Privileged Information Distillation for Language Models
Llms

[2602.04942] Privileged Information Distillation for Language Models

This paper presents methods for distilling privileged information in language models, focusing on improving performance in multi-turn env...

arXiv - AI · 4 min ·
[2506.02634] KVCache Cache in the Wild: Characterizing and Optimizing KVCache Cache at a Large Cloud Provider
Llms

[2506.02634] KVCache Cache in the Wild: Characterizing and Optimizing KVCache Cache at a Large Cloud Provider

This paper characterizes and optimizes KVCache, a caching mechanism for large language model (LLM) serving at a major cloud provider, hig...

arXiv - AI · 4 min ·
[2602.03546] How to Train Your Resistive Network: Generalized Equilibrium Propagation and Analytical Learning
Machine Learning

[2602.03546] How to Train Your Resistive Network: Generalized Equilibrium Propagation and Analytical Learning

This paper presents a novel algorithm for training resistive networks using Generalized Equilibrium Propagation, aiming to enhance energy...

arXiv - Machine Learning · 4 min ·
[2602.02201] Cardinality-Preserving Attention Channels for Graph Transformers in Molecular Property Prediction
Machine Learning

[2602.02201] Cardinality-Preserving Attention Channels for Graph Transformers in Molecular Property Prediction

This article presents a novel graph transformer model, incorporating cardinality-preserving attention channels, to enhance molecular prop...

arXiv - Machine Learning · 3 min ·
[2602.01051] SwiftRepertoire: Few-Shot Immune-Signature Synthesis via Dynamic Kernel Codes
Ai Infrastructure

[2602.01051] SwiftRepertoire: Few-Shot Immune-Signature Synthesis via Dynamic Kernel Codes

The paper presents SwiftRepertoire, a framework for synthesizing immune signatures using few-shot learning techniques, enabling efficient...

arXiv - Machine Learning · 4 min ·
[2505.07861] Scalable LLM Reasoning Acceleration with Low-rank Distillation
Llms

[2505.07861] Scalable LLM Reasoning Acceleration with Low-rank Distillation

The paper presents Caprese, a low-rank distillation method designed to enhance reasoning capabilities in large language models (LLMs) whi...

arXiv - Machine Learning · 3 min ·
[2601.22323] Models Under SCOPE: Scalable and Controllable Routing via Pre-hoc Reasoning
Llms

[2601.22323] Models Under SCOPE: Scalable and Controllable Routing via Pre-hoc Reasoning

The paper presents SCOPE, a novel routing framework for language models that dynamically predicts cost and performance, enhancing efficie...

arXiv - Machine Learning · 4 min ·
[2505.07755] Benchmarking of CPU-intensive Stream Data Processing in The Edge Computing Systems
Ai Infrastructure

[2505.07755] Benchmarking of CPU-intensive Stream Data Processing in The Edge Computing Systems

This article evaluates CPU-intensive stream data processing in edge computing systems, highlighting performance and power consumption opt...

arXiv - AI · 4 min ·
[2601.18702] From Fuzzy to Exact: The Halo Architecture for Infinite-Depth Reasoning via Rational Arithmetic
Llms

[2601.18702] From Fuzzy to Exact: The Halo Architecture for Infinite-Depth Reasoning via Rational Arithmetic

This paper introduces the Halo Architecture, a new framework for infinite-depth reasoning using rational arithmetic, aiming to enhance th...

arXiv - AI · 4 min ·
[2601.03213] Critic-Guided Reinforcement Unlearning in Text-to-Image Diffusion
Machine Learning

[2601.03213] Critic-Guided Reinforcement Unlearning in Text-to-Image Diffusion

The paper presents a novel reinforcement learning framework for unlearning targeted concepts in text-to-image diffusion models, enhancing...

arXiv - Machine Learning · 4 min ·
[2512.20885] From GNNs to Symbolic Surrogates via Kolmogorov-Arnold Networks for Delay Prediction
Machine Learning

[2512.20885] From GNNs to Symbolic Surrogates via Kolmogorov-Arnold Networks for Delay Prediction

This paper explores the use of Kolmogorov-Arnold Networks (KAN) for predicting flow delays in communication networks, enhancing efficienc...

arXiv - Machine Learning · 3 min ·
[2511.17879] Generative Adversarial Post-Training Mitigates Reward Hacking in Live Human-AI Music Interaction
Machine Learning

[2511.17879] Generative Adversarial Post-Training Mitigates Reward Hacking in Live Human-AI Music Interaction

This paper presents a novel method using generative adversarial training to address reward hacking in real-time human-AI music interactio...

arXiv - Machine Learning · 4 min ·
[2511.16652] Evolution Strategies at the Hyperscale
Ai Infrastructure

[2511.16652] Evolution Strategies at the Hyperscale

The paper presents EGGROLL, an enhanced Evolution Strategy for optimizing large-scale models, achieving significant speed improvements an...

arXiv - AI · 4 min ·
[2408.10746] Resource-Efficient Personal Large Language Models Fine-Tuning with Collaborative Edge Computing
Llms

[2408.10746] Resource-Efficient Personal Large Language Models Fine-Tuning with Collaborative Edge Computing

The paper presents PAC, a collaborative edge computing framework designed for resource-efficient fine-tuning of personal large language m...

arXiv - Machine Learning · 4 min ·
[2510.15987] Algorithmic Primitives and Compositional Geometry of Reasoning in Language Models
Llms

[2510.15987] Algorithmic Primitives and Compositional Geometry of Reasoning in Language Models

The paper explores how algorithmic primitives and compositional geometry can enhance reasoning capabilities in large language models (LLM...

arXiv - AI · 4 min ·
[2510.13654] Challenges and Requirements for Benchmarking Time Series Foundation Models
Llms

[2510.13654] Challenges and Requirements for Benchmarking Time Series Foundation Models

This article discusses the challenges and requirements for benchmarking Time Series Foundation Models (TSFMs), highlighting issues of inf...

arXiv - Machine Learning · 3 min ·
[2406.04955] Experimental Evaluation of ROS-Causal in Real-World Human-Robot Spatial Interaction Scenarios
Machine Learning

[2406.04955] Experimental Evaluation of ROS-Causal in Real-World Human-Robot Spatial Interaction Scenarios

This article presents an experimental evaluation of ROS-Causal, a framework for causal discovery in human-robot spatial interactions, dem...

arXiv - AI · 4 min ·
[2510.07182] Bridged Clustering: Semi-Supervised Sparse Bridging
Machine Learning

[2510.07182] Bridged Clustering: Semi-Supervised Sparse Bridging

The paper introduces Bridged Clustering, a semi-supervised framework that learns predictors from unpaired datasets by clustering inputs a...

arXiv - Machine Learning · 3 min ·
[2404.08634] When Attention Collapses: How Degenerate Layers in LLMs Enable Smaller, Stronger Models
Llms

[2404.08634] When Attention Collapses: How Degenerate Layers in LLMs Enable Smaller, Stronger Models

This article explores the phenomenon of 'attention collapse' in large language models (LLMs) and introduces Inheritune, a method for crea...

arXiv - Machine Learning · 4 min ·
Previous Page 166 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime