AI Infrastructure

GPUs, training clusters, MLOps, and deployment

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
[2603.12372] Efficient Reasoning with Balanced Thinking
Machine Learning

[2603.12372] Efficient Reasoning with Balanced Thinking

Abstract page for arXiv paper 2603.12372: Efficient Reasoning with Balanced Thinking

arXiv - Machine Learning · 4 min ·
[2510.13714] DeDelayed: Deleting Remote Inference Delay via On-Device Correction
Machine Learning

[2510.13714] DeDelayed: Deleting Remote Inference Delay via On-Device Correction

Abstract page for arXiv paper 2510.13714: DeDelayed: Deleting Remote Inference Delay via On-Device Correction

arXiv - Machine Learning · 4 min ·

All Content

Machine Learning

[P] We made GoodSeed, a pleasant ML experiment tracker

GoodSeed v0.3.0 🎉 I and my friend are pleased to announce GoodSeed - a ML experiment tracker which we are now using as a replacement for ...

Reddit - Machine Learning · 1 min ·
Llms

[D] Predicting total cost of agentic LLM workflows - is there a research gap around output token count and chain depth estimation?

Working on a practical problem that I think has an interesting ML angle. In agentic LLM workflows (tool use, multi-step reasoning, ReAct-...

Reddit - Machine Learning · 1 min ·
LLMs can unmask pseudonymous users at scale with surprising accuracy - Ars Technica
Llms

LLMs can unmask pseudonymous users at scale with surprising accuracy - Ars Technica

Pseudonymity has never been perfect for preserving privacy. Soon it may be pointless.

Ars Technica - AI · 7 min ·
Machine Learning

[P] On-device Qwen3-TTS (1.7B/0.6B) inference on iOS and macOS via MLX-Swift — voice cloning, voice design, and streaming TTS with no cloud

Hey r/MachineLearning. I'm a solo dev working on on-device TTS using MLX-Swift with Qwen3-TTS. 1.7B model on macOS, 0.6B on iOS, quantize...

Reddit - Machine Learning · 1 min ·
Machine Learning

[R] To the Women of Machine Learning - I'm Hiring!

It's not a secret that ML Engineers are predominantly men. Still, as I work to build a foundational ML team, I am being intentional about...

Reddit - Machine Learning · 1 min ·
[2511.01266] MotionStream: Real-Time Video Generation with Interactive Motion Controls
Machine Learning

[2511.01266] MotionStream: Real-Time Video Generation with Interactive Motion Controls

Abstract page for arXiv paper 2511.01266: MotionStream: Real-Time Video Generation with Interactive Motion Controls

arXiv - Machine Learning · 4 min ·
[2510.13849] Language steering in latent space to mitigate unintended code-switching
Llms

[2510.13849] Language steering in latent space to mitigate unintended code-switching

Abstract page for arXiv paper 2510.13849: Language steering in latent space to mitigate unintended code-switching

arXiv - Machine Learning · 3 min ·
[2509.22459] Universal Inverse Distillation for Matching Models with Real-Data Supervision (No GANs)
Machine Learning

[2509.22459] Universal Inverse Distillation for Matching Models with Real-Data Supervision (No GANs)

Abstract page for arXiv paper 2509.22459: Universal Inverse Distillation for Matching Models with Real-Data Supervision (No GANs)

arXiv - Machine Learning · 4 min ·
[2509.21764] CubistMerge: Spatial-Preserving Token Merging For Diverse ViT Backbones
Nlp

[2509.21764] CubistMerge: Spatial-Preserving Token Merging For Diverse ViT Backbones

Abstract page for arXiv paper 2509.21764: CubistMerge: Spatial-Preserving Token Merging For Diverse ViT Backbones

arXiv - Machine Learning · 4 min ·
[2509.10756] Quantum parameter estimation with uncertainty quantification from continuous measurement data using neural network ensembles
Machine Learning

[2509.10756] Quantum parameter estimation with uncertainty quantification from continuous measurement data using neural network ensembles

Abstract page for arXiv paper 2509.10756: Quantum parameter estimation with uncertainty quantification from continuous measurement data u...

arXiv - Machine Learning · 3 min ·
[2509.01799] Optimal information injection and transfer mechanisms for active matter reservoir computing
Machine Learning

[2509.01799] Optimal information injection and transfer mechanisms for active matter reservoir computing

Abstract page for arXiv paper 2509.01799: Optimal information injection and transfer mechanisms for active matter reservoir computing

arXiv - Machine Learning · 4 min ·
[2507.16001] Separating Ansatz Discovery from Deployment on Larger Problems: Reinforcement Learning for Modular Circuit Design
Machine Learning

[2507.16001] Separating Ansatz Discovery from Deployment on Larger Problems: Reinforcement Learning for Modular Circuit Design

Abstract page for arXiv paper 2507.16001: Separating Ansatz Discovery from Deployment on Larger Problems: Reinforcement Learning for Modu...

arXiv - Machine Learning · 4 min ·
[2507.07469] A Projection-Based ARIMA Framework for Nonlinear Dynamics in Macroeconomic and Financial Time Series: Closed-Form Estimation and Rolling-Window Inference
Machine Learning

[2507.07469] A Projection-Based ARIMA Framework for Nonlinear Dynamics in Macroeconomic and Financial Time Series: Closed-Form Estimation and Rolling-Window Inference

Abstract page for arXiv paper 2507.07469: A Projection-Based ARIMA Framework for Nonlinear Dynamics in Macroeconomic and Financial Time S...

arXiv - Machine Learning · 4 min ·
[2506.05639] FictionalQA: A Dataset for Studying Memorization and Knowledge Acquisition
Llms

[2506.05639] FictionalQA: A Dataset for Studying Memorization and Knowledge Acquisition

Abstract page for arXiv paper 2506.05639: FictionalQA: A Dataset for Studying Memorization and Knowledge Acquisition

arXiv - Machine Learning · 3 min ·
[2501.15849] Data-Driven Prediction and Control of Hammerstein-Wiener Systems with Implicit Gaussian Processes
Machine Learning

[2501.15849] Data-Driven Prediction and Control of Hammerstein-Wiener Systems with Implicit Gaussian Processes

Abstract page for arXiv paper 2501.15849: Data-Driven Prediction and Control of Hammerstein-Wiener Systems with Implicit Gaussian Processes

arXiv - Machine Learning · 4 min ·
[2406.16227] VICatMix: variational Bayesian clustering and variable selection for discrete biomedical data
Machine Learning

[2406.16227] VICatMix: variational Bayesian clustering and variable selection for discrete biomedical data

Abstract page for arXiv paper 2406.16227: VICatMix: variational Bayesian clustering and variable selection for discrete biomedical data

arXiv - Machine Learning · 4 min ·
[2602.04083] Structure-Informed Estimation for Pilot-Limited MIMO Channels via Tensor Decomposition
Ai Infrastructure

[2602.04083] Structure-Informed Estimation for Pilot-Limited MIMO Channels via Tensor Decomposition

Abstract page for arXiv paper 2602.04083: Structure-Informed Estimation for Pilot-Limited MIMO Channels via Tensor Decomposition

arXiv - AI · 4 min ·
[2602.01649] Contribution-aware Token Compression for Efficient Video Understanding via Reinforcement Learning
Llms

[2602.01649] Contribution-aware Token Compression for Efficient Video Understanding via Reinforcement Learning

Abstract page for arXiv paper 2602.01649: Contribution-aware Token Compression for Efficient Video Understanding via Reinforcement Learning

arXiv - AI · 4 min ·
[2602.08324] Towards Efficient Large Language Reasoning Models via Extreme-Ratio Chain-of-Thought Compression
Llms

[2602.08324] Towards Efficient Large Language Reasoning Models via Extreme-Ratio Chain-of-Thought Compression

Abstract page for arXiv paper 2602.08324: Towards Efficient Large Language Reasoning Models via Extreme-Ratio Chain-of-Thought Compression

arXiv - Machine Learning · 4 min ·
[2602.05735] CSRv2: Unlocking Ultra-Sparse Embeddings
Llms

[2602.05735] CSRv2: Unlocking Ultra-Sparse Embeddings

Abstract page for arXiv paper 2602.05735: CSRv2: Unlocking Ultra-Sparse Embeddings

arXiv - Machine Learning · 4 min ·
Previous Page 53 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime