AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 2 hours ago

Machine Learning

[2603.12372] Efficient Reasoning with Balanced Thinking

Abstract page for arXiv paper 2603.12372: Efficient Reasoning with Balanced Thinking

arXiv - Machine Learning · 4 min · about 3 hours ago

Machine Learning

[2510.13714] DeDelayed: Deleting Remote Inference Delay via On-Device Correction

Abstract page for arXiv paper 2510.13714: DeDelayed: Deleting Remote Inference Delay via On-Device Correction

arXiv - Machine Learning · 4 min · about 3 hours ago

All Content

Machine Learning

[P] We made GoodSeed, a pleasant ML experiment tracker

GoodSeed v0.3.0 🎉 I and my friend are pleased to announce GoodSeed - a ML experiment tracker which we are now using as a replacement for ...

Reddit - Machine Learning · 1 min · about 1 month ago

Llms

[D] Predicting total cost of agentic LLM workflows - is there a research gap around output token count and chain depth estimation?

Working on a practical problem that I think has an interesting ML angle. In agentic LLM workflows (tool use, multi-step reasoning, ReAct-...

Reddit - Machine Learning · 1 min · about 1 month ago

Llms

LLMs can unmask pseudonymous users at scale with surprising accuracy - Ars Technica

Pseudonymity has never been perfect for preserving privacy. Soon it may be pointless.

Ars Technica - AI · 7 min · about 1 month ago

Machine Learning

[P] On-device Qwen3-TTS (1.7B/0.6B) inference on iOS and macOS via MLX-Swift — voice cloning, voice design, and streaming TTS with no cloud

Hey r/MachineLearning. I'm a solo dev working on on-device TTS using MLX-Swift with Qwen3-TTS. 1.7B model on macOS, 0.6B on iOS, quantize...

Reddit - Machine Learning · 1 min · about 1 month ago

Machine Learning

[R] To the Women of Machine Learning - I'm Hiring!

It's not a secret that ML Engineers are predominantly men. Still, as I work to build a foundational ML team, I am being intentional about...

Reddit - Machine Learning · 1 min · about 1 month ago

Machine Learning

[2511.01266] MotionStream: Real-Time Video Generation with Interactive Motion Controls

Abstract page for arXiv paper 2511.01266: MotionStream: Real-Time Video Generation with Interactive Motion Controls

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.13849] Language steering in latent space to mitigate unintended code-switching

Abstract page for arXiv paper 2510.13849: Language steering in latent space to mitigate unintended code-switching

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2509.22459] Universal Inverse Distillation for Matching Models with Real-Data Supervision (No GANs)

Abstract page for arXiv paper 2509.22459: Universal Inverse Distillation for Matching Models with Real-Data Supervision (No GANs)

arXiv - Machine Learning · 4 min · about 1 month ago

Nlp

[2509.21764] CubistMerge: Spatial-Preserving Token Merging For Diverse ViT Backbones

Abstract page for arXiv paper 2509.21764: CubistMerge: Spatial-Preserving Token Merging For Diverse ViT Backbones

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2509.10756] Quantum parameter estimation with uncertainty quantification from continuous measurement data using neural network ensembles

Abstract page for arXiv paper 2509.10756: Quantum parameter estimation with uncertainty quantification from continuous measurement data u...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2509.01799] Optimal information injection and transfer mechanisms for active matter reservoir computing

Abstract page for arXiv paper 2509.01799: Optimal information injection and transfer mechanisms for active matter reservoir computing

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2507.16001] Separating Ansatz Discovery from Deployment on Larger Problems: Reinforcement Learning for Modular Circuit Design

Abstract page for arXiv paper 2507.16001: Separating Ansatz Discovery from Deployment on Larger Problems: Reinforcement Learning for Modu...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2507.07469] A Projection-Based ARIMA Framework for Nonlinear Dynamics in Macroeconomic and Financial Time Series: Closed-Form Estimation and Rolling-Window Inference

Abstract page for arXiv paper 2507.07469: A Projection-Based ARIMA Framework for Nonlinear Dynamics in Macroeconomic and Financial Time S...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2506.05639] FictionalQA: A Dataset for Studying Memorization and Knowledge Acquisition

Abstract page for arXiv paper 2506.05639: FictionalQA: A Dataset for Studying Memorization and Knowledge Acquisition

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2501.15849] Data-Driven Prediction and Control of Hammerstein-Wiener Systems with Implicit Gaussian Processes

Abstract page for arXiv paper 2501.15849: Data-Driven Prediction and Control of Hammerstein-Wiener Systems with Implicit Gaussian Processes

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2406.16227] VICatMix: variational Bayesian clustering and variable selection for discrete biomedical data

Abstract page for arXiv paper 2406.16227: VICatMix: variational Bayesian clustering and variable selection for discrete biomedical data

arXiv - Machine Learning · 4 min · about 1 month ago

Ai Infrastructure

[2602.04083] Structure-Informed Estimation for Pilot-Limited MIMO Channels via Tensor Decomposition

Abstract page for arXiv paper 2602.04083: Structure-Informed Estimation for Pilot-Limited MIMO Channels via Tensor Decomposition

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.01649] Contribution-aware Token Compression for Efficient Video Understanding via Reinforcement Learning

Abstract page for arXiv paper 2602.01649: Contribution-aware Token Compression for Efficient Video Understanding via Reinforcement Learning

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.08324] Towards Efficient Large Language Reasoning Models via Extreme-Ratio Chain-of-Thought Compression

Abstract page for arXiv paper 2602.08324: Towards Efficient Large Language Reasoning Models via Extreme-Ratio Chain-of-Thought Compression

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.05735] CSRv2: Unlocking Ultra-Sparse Embeddings

Abstract page for arXiv paper 2602.05735: CSRv2: Unlocking Ultra-Sparse Embeddings

arXiv - Machine Learning · 4 min · about 1 month ago

Previous Page 53 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence

[2603.12372] Efficient Reasoning with Balanced Thinking

[2510.13714] DeDelayed: Deleting Remote Inference Delay via On-Device Correction

All Content

[P] We made GoodSeed, a pleasant ML experiment tracker

[D] Predicting total cost of agentic LLM workflows - is there a research gap around output token count and chain depth estimation?

LLMs can unmask pseudonymous users at scale with surprising accuracy - Ars Technica

[P] On-device Qwen3-TTS (1.7B/0.6B) inference on iOS and macOS via MLX-Swift — voice cloning, voice design, and streaming TTS with no cloud

[R] To the Women of Machine Learning - I'm Hiring!

[2511.01266] MotionStream: Real-Time Video Generation with Interactive Motion Controls

[2510.13849] Language steering in latent space to mitigate unintended code-switching

[2509.22459] Universal Inverse Distillation for Matching Models with Real-Data Supervision (No GANs)

[2509.21764] CubistMerge: Spatial-Preserving Token Merging For Diverse ViT Backbones

[2509.10756] Quantum parameter estimation with uncertainty quantification from continuous measurement data using neural network ensembles

[2509.01799] Optimal information injection and transfer mechanisms for active matter reservoir computing

[2507.16001] Separating Ansatz Discovery from Deployment on Larger Problems: Reinforcement Learning for Modular Circuit Design

[2507.07469] A Projection-Based ARIMA Framework for Nonlinear Dynamics in Macroeconomic and Financial Time Series: Closed-Form Estimation and Rolling-Window Inference

[2506.05639] FictionalQA: A Dataset for Studying Memorization and Knowledge Acquisition

[2501.15849] Data-Driven Prediction and Control of Hammerstein-Wiener Systems with Implicit Gaussian Processes

[2406.16227] VICatMix: variational Bayesian clustering and variable selection for discrete biomedical data

[2602.04083] Structure-Informed Estimation for Pilot-Limited MIMO Channels via Tensor Decomposition

[2602.01649] Contribution-aware Token Compression for Efficient Video Understanding via Reinforcement Learning

[2602.08324] Towards Efficient Large Language Reasoning Models via Extreme-Ratio Chain-of-Thought Compression

[2602.05735] CSRv2: Unlocking Ultra-Sparse Embeddings

Related Topics

Stay updated with AI News