AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 3 hours ago

Llms

What is the current landscape on AI agents knowledge

Recently used "free" rates codex to give me a quick fastapi project sample. It gave me deprecated (a)app.on_event("startup). What are you...

Reddit - Artificial Intelligence · 1 min · about 14 hours ago

Open Source Ai

NVIDIA Isaac GR00T N1.7: Open Reasoning VLA Model for Humanoid Robots

A Blog post by NVIDIA on Hugging Face

Hugging Face Blog · 4 min · about 19 hours ago

All Content

Machine Learning

[2602.05319] Accelerated Sequential Flow Matching: A Bayesian Filtering Perspective

This paper introduces Accelerated Sequential Flow Matching, a Bayesian filtering framework that enhances real-time inference in stochasti...

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2602.04942] Privileged Information Distillation for Language Models

This paper presents methods for distilling privileged information in language models, focusing on improving performance in multi-turn env...

arXiv - AI · 4 min · 2 months ago

Llms

[2506.02634] KVCache Cache in the Wild: Characterizing and Optimizing KVCache Cache at a Large Cloud Provider

This paper characterizes and optimizes KVCache, a caching mechanism for large language model (LLM) serving at a major cloud provider, hig...

arXiv - AI · 4 min · 2 months ago

Machine Learning

[2602.03546] How to Train Your Resistive Network: Generalized Equilibrium Propagation and Analytical Learning

This paper presents a novel algorithm for training resistive networks using Generalized Equilibrium Propagation, aiming to enhance energy...

arXiv - Machine Learning · 4 min · 2 months ago

Machine Learning

[2602.02201] Cardinality-Preserving Attention Channels for Graph Transformers in Molecular Property Prediction

This article presents a novel graph transformer model, incorporating cardinality-preserving attention channels, to enhance molecular prop...

arXiv - Machine Learning · 3 min · 2 months ago

Ai Infrastructure

[2602.01051] SwiftRepertoire: Few-Shot Immune-Signature Synthesis via Dynamic Kernel Codes

The paper presents SwiftRepertoire, a framework for synthesizing immune signatures using few-shot learning techniques, enabling efficient...

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2505.07861] Scalable LLM Reasoning Acceleration with Low-rank Distillation

The paper presents Caprese, a low-rank distillation method designed to enhance reasoning capabilities in large language models (LLMs) whi...

arXiv - Machine Learning · 3 min · 2 months ago

Llms

[2601.22323] Models Under SCOPE: Scalable and Controllable Routing via Pre-hoc Reasoning

The paper presents SCOPE, a novel routing framework for language models that dynamically predicts cost and performance, enhancing efficie...

arXiv - Machine Learning · 4 min · 2 months ago

Ai Infrastructure

[2505.07755] Benchmarking of CPU-intensive Stream Data Processing in The Edge Computing Systems

This article evaluates CPU-intensive stream data processing in edge computing systems, highlighting performance and power consumption opt...

arXiv - AI · 4 min · 2 months ago

Llms

[2601.18702] From Fuzzy to Exact: The Halo Architecture for Infinite-Depth Reasoning via Rational Arithmetic

This paper introduces the Halo Architecture, a new framework for infinite-depth reasoning using rational arithmetic, aiming to enhance th...

arXiv - AI · 4 min · 2 months ago

Machine Learning

[2601.03213] Critic-Guided Reinforcement Unlearning in Text-to-Image Diffusion

The paper presents a novel reinforcement learning framework for unlearning targeted concepts in text-to-image diffusion models, enhancing...

arXiv - Machine Learning · 4 min · 2 months ago

Machine Learning

[2512.20885] From GNNs to Symbolic Surrogates via Kolmogorov-Arnold Networks for Delay Prediction

This paper explores the use of Kolmogorov-Arnold Networks (KAN) for predicting flow delays in communication networks, enhancing efficienc...

arXiv - Machine Learning · 3 min · 2 months ago

Machine Learning

[2511.17879] Generative Adversarial Post-Training Mitigates Reward Hacking in Live Human-AI Music Interaction

This paper presents a novel method using generative adversarial training to address reward hacking in real-time human-AI music interactio...

arXiv - Machine Learning · 4 min · 2 months ago

Ai Infrastructure

[2511.16652] Evolution Strategies at the Hyperscale

The paper presents EGGROLL, an enhanced Evolution Strategy for optimizing large-scale models, achieving significant speed improvements an...

arXiv - AI · 4 min · 2 months ago

Llms

[2408.10746] Resource-Efficient Personal Large Language Models Fine-Tuning with Collaborative Edge Computing

The paper presents PAC, a collaborative edge computing framework designed for resource-efficient fine-tuning of personal large language m...

arXiv - Machine Learning · 4 min · 2 months ago

Llms

[2510.15987] Algorithmic Primitives and Compositional Geometry of Reasoning in Language Models

The paper explores how algorithmic primitives and compositional geometry can enhance reasoning capabilities in large language models (LLM...

arXiv - AI · 4 min · 2 months ago

Llms

[2510.13654] Challenges and Requirements for Benchmarking Time Series Foundation Models

This article discusses the challenges and requirements for benchmarking Time Series Foundation Models (TSFMs), highlighting issues of inf...

arXiv - Machine Learning · 3 min · 2 months ago

Machine Learning

[2406.04955] Experimental Evaluation of ROS-Causal in Real-World Human-Robot Spatial Interaction Scenarios

This article presents an experimental evaluation of ROS-Causal, a framework for causal discovery in human-robot spatial interactions, dem...

arXiv - AI · 4 min · 2 months ago

Machine Learning

[2510.07182] Bridged Clustering: Semi-Supervised Sparse Bridging

The paper introduces Bridged Clustering, a semi-supervised framework that learns predictors from unpaired datasets by clustering inputs a...

arXiv - Machine Learning · 3 min · 2 months ago

Llms

[2404.08634] When Attention Collapses: How Degenerate Layers in LLMs Enable Smaller, Stronger Models

This article explores the phenomenon of 'attention collapse' in large language models (LLMs) and introduces Inheritune, a method for crea...

arXiv - Machine Learning · 4 min · 2 months ago

Previous Page 166 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence

What is the current landscape on AI agents knowledge

NVIDIA Isaac GR00T N1.7: Open Reasoning VLA Model for Humanoid Robots

All Content

[2602.05319] Accelerated Sequential Flow Matching: A Bayesian Filtering Perspective

[2602.04942] Privileged Information Distillation for Language Models

[2506.02634] KVCache Cache in the Wild: Characterizing and Optimizing KVCache Cache at a Large Cloud Provider

[2602.03546] How to Train Your Resistive Network: Generalized Equilibrium Propagation and Analytical Learning

[2602.02201] Cardinality-Preserving Attention Channels for Graph Transformers in Molecular Property Prediction

[2602.01051] SwiftRepertoire: Few-Shot Immune-Signature Synthesis via Dynamic Kernel Codes

[2505.07861] Scalable LLM Reasoning Acceleration with Low-rank Distillation

[2601.22323] Models Under SCOPE: Scalable and Controllable Routing via Pre-hoc Reasoning

[2505.07755] Benchmarking of CPU-intensive Stream Data Processing in The Edge Computing Systems

[2601.18702] From Fuzzy to Exact: The Halo Architecture for Infinite-Depth Reasoning via Rational Arithmetic

[2601.03213] Critic-Guided Reinforcement Unlearning in Text-to-Image Diffusion

[2512.20885] From GNNs to Symbolic Surrogates via Kolmogorov-Arnold Networks for Delay Prediction

[2511.17879] Generative Adversarial Post-Training Mitigates Reward Hacking in Live Human-AI Music Interaction

[2511.16652] Evolution Strategies at the Hyperscale

[2408.10746] Resource-Efficient Personal Large Language Models Fine-Tuning with Collaborative Edge Computing

[2510.15987] Algorithmic Primitives and Compositional Geometry of Reasoning in Language Models

[2510.13654] Challenges and Requirements for Benchmarking Time Series Foundation Models

[2406.04955] Experimental Evaluation of ROS-Causal in Real-World Human-Robot Spatial Interaction Scenarios

[2510.07182] Bridged Clustering: Semi-Supervised Sparse Bridging

[2404.08634] When Attention Collapses: How Degenerate Layers in LLMs Enable Smaller, Stronger Models

Related Topics

Stay updated with AI News