AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 3 hours ago

Ai Infrastructure

[P] GPU friendly lossless 12-bit BF16 format with 0.03% escape rate and 1 integer ADD decode works for AMD & NVIDIA

Hi everyone : ) I just released a new research prototype It’s a lossless BF16 compression format that stores weights in 12 bits by replac...

Reddit - Machine Learning · 1 min · about 4 hours ago

Ai Infrastructure

OpenAI’s Fidji Simo Is Taking Medical Leave Amid an Executive Shake-Up | WIRED

The company is undergoing major leadership restructuring as its CEO of AGI deployment goes on leave for “several weeks.”

Wired - AI · 5 min · about 8 hours ago

All Content

Llms

[2602.23653] ProtoDCS: Towards Robust and Efficient Open-Set Test-Time Adaptation for Vision-Language Models

Abstract page for arXiv paper 2602.23653: ProtoDCS: Towards Robust and Efficient Open-Set Test-Time Adaptation for Vision-Language Models

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.23636] FlexGuard: Continuous Risk Scoring for Strictness-Adaptive LLM Content Moderation

Abstract page for arXiv paper 2602.23636: FlexGuard: Continuous Risk Scoring for Strictness-Adaptive LLM Content Moderation

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.23528] Neural Operators Can Discover Functional Clusters

Abstract page for arXiv paper 2602.23528: Neural Operators Can Discover Functional Clusters

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.23574] Evidential Neural Radiance Fields

Abstract page for arXiv paper 2602.23574: Evidential Neural Radiance Fields

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.23546] Humans and LLMs Diverge on Probabilistic Inferences

Abstract page for arXiv paper 2602.23546: Humans and LLMs Diverge on Probabilistic Inferences

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2602.23509] SegReg: Latent Space Regularization for Improved Medical Image Segmentation

Abstract page for arXiv paper 2602.23509: SegReg: Latent Space Regularization for Improved Medical Image Segmentation

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.23391] Detoxifying LLMs via Representation Erasure-Based Preference Optimization

Abstract page for arXiv paper 2602.23391: Detoxifying LLMs via Representation Erasure-Based Preference Optimization

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2602.23372] Democratizing GraphRAG: Linear, CPU-Only Graph Retrieval for Multi-Hop QA

Abstract page for arXiv paper 2602.23372: Democratizing GraphRAG: Linear, CPU-Only Graph Retrieval for Multi-Hop QA

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.23370] Toward General Semantic Chunking: A Discriminative Framework for Ultra-Long Documents

Abstract page for arXiv paper 2602.23370: Toward General Semantic Chunking: A Discriminative Framework for Ultra-Long Documents

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.24195] Uncertainty Quantification for Multimodal Large Language Models with Incoherence-adjusted Semantic Volume

Abstract page for arXiv paper 2602.24195: Uncertainty Quantification for Multimodal Large Language Models with Incoherence-adjusted Seman...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.24055] CIRCLE: A Framework for Evaluating AI from a Real-World Lens

Abstract page for arXiv paper 2602.24055: CIRCLE: A Framework for Evaluating AI from a Real-World Lens

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.23720] The Auton Agentic AI Framework

Abstract page for arXiv paper 2602.23720: The Auton Agentic AI Framework

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.23681] ODAR: Principled Adaptive Routing for LLM Reasoning via Active Inference

Abstract page for arXiv paper 2602.23681: ODAR: Principled Adaptive Routing for LLM Reasoning via Active Inference

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[D] Geospatial ML for humanitarian drought/flood forecasting: critique my approach / ideas for predictive urgency index

I'm working on a non-commercial geospatial ML project (AidMap AI) focused on Central Asia/Afghanistan/Syria – predicting "urgency levels"...

Reddit - Machine Learning · 1 min · about 1 month ago

Machine Learning

[P] easy-torch-tpu: Making it easy to train PyTorch-based models on Google TPUs

I've been working with Google TPU clusters for a few months now, and using PyTorch/XLA to train PyTorch-based models on them has frankly ...

Reddit - Machine Learning · 1 min · about 1 month ago

Ai Infrastructure

OpenAI eyes global domination with $110B Amazon and NVIDIA raise, value hits $840B

submitted by /u/sksarkpoes3 [link] [comments]

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Llms

[R] Benchmarked 94 LLM endpoints for jan 2026. open source is now within 5 quality points of proprietary

been doing a deep dive on model selection for production inference and pulled togethar some numbers from whatllm.org's january 2026 repor...

Reddit - Machine Learning · 1 min · about 1 month ago

Machine Learning

DeepSeek optimizing for Chinese chips

Deepseek is about to drop V4, and the real story isn’t the model. It’s that they’ve optimized it to run on Huawei and Cambricon chips ins...

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Llms

"You are humanity personified in 2076"

A continuation of the first time I did this with a narrative of humanity since the dawn of civilization. Really starting to get into thes...

Reddit - Artificial Intelligence · 1 min · about 1 month ago

Machine Learning

[D] got tired of "just vibes" testing for edge ML models, so I built automated quality gates

so about 6 months ago I was messing around with a vision model on a Snapdragon device as a side project. worked great on my laptop. deplo...

Reddit - Machine Learning · 1 min · about 1 month ago

Previous Page 66 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence

[P] GPU friendly lossless 12-bit BF16 format with 0.03% escape rate and 1 integer ADD decode works for AMD & NVIDIA

OpenAI’s Fidji Simo Is Taking Medical Leave Amid an Executive Shake-Up | WIRED

All Content

[2602.23653] ProtoDCS: Towards Robust and Efficient Open-Set Test-Time Adaptation for Vision-Language Models

[2602.23636] FlexGuard: Continuous Risk Scoring for Strictness-Adaptive LLM Content Moderation

[2602.23528] Neural Operators Can Discover Functional Clusters

[2602.23574] Evidential Neural Radiance Fields

[2602.23546] Humans and LLMs Diverge on Probabilistic Inferences

[2602.23509] SegReg: Latent Space Regularization for Improved Medical Image Segmentation

[2602.23391] Detoxifying LLMs via Representation Erasure-Based Preference Optimization

[2602.23372] Democratizing GraphRAG: Linear, CPU-Only Graph Retrieval for Multi-Hop QA

[2602.23370] Toward General Semantic Chunking: A Discriminative Framework for Ultra-Long Documents

[2602.24195] Uncertainty Quantification for Multimodal Large Language Models with Incoherence-adjusted Semantic Volume

[2602.24055] CIRCLE: A Framework for Evaluating AI from a Real-World Lens

[2602.23720] The Auton Agentic AI Framework

[2602.23681] ODAR: Principled Adaptive Routing for LLM Reasoning via Active Inference

[D] Geospatial ML for humanitarian drought/flood forecasting: critique my approach / ideas for predictive urgency index

[P] easy-torch-tpu: Making it easy to train PyTorch-based models on Google TPUs

OpenAI eyes global domination with $110B Amazon and NVIDIA raise, value hits $840B

[R] Benchmarked 94 LLM endpoints for jan 2026. open source is now within 5 quality points of proprietary

DeepSeek optimizing for Chinese chips

"You are humanity personified in 2076"

[D] got tired of "just vibes" testing for edge ML models, so I built automated quality gates

Related Topics

Stay updated with AI News