AI Infrastructure

GPUs, training clusters, MLOps, and deployment

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Ai Infrastructure

[P] GPU friendly lossless 12-bit BF16 format with 0.03% escape rate and 1 integer ADD decode works for AMD & NVIDIA

Hi everyone : ) I just released a new research prototype It’s a lossless BF16 compression format that stores weights in 12 bits by replac...

Reddit - Machine Learning · 1 min ·
OpenAI’s Fidji Simo Is Taking Medical Leave Amid an Executive Shake-Up | WIRED
Ai Infrastructure

OpenAI’s Fidji Simo Is Taking Medical Leave Amid an Executive Shake-Up | WIRED

The company is undergoing major leadership restructuring as its CEO of AGI deployment goes on leave for “several weeks.”

Wired - AI · 5 min ·

All Content

[2602.23653] ProtoDCS: Towards Robust and Efficient Open-Set Test-Time Adaptation for Vision-Language Models
Llms

[2602.23653] ProtoDCS: Towards Robust and Efficient Open-Set Test-Time Adaptation for Vision-Language Models

Abstract page for arXiv paper 2602.23653: ProtoDCS: Towards Robust and Efficient Open-Set Test-Time Adaptation for Vision-Language Models

arXiv - AI · 4 min ·
[2602.23636] FlexGuard: Continuous Risk Scoring for Strictness-Adaptive LLM Content Moderation
Llms

[2602.23636] FlexGuard: Continuous Risk Scoring for Strictness-Adaptive LLM Content Moderation

Abstract page for arXiv paper 2602.23636: FlexGuard: Continuous Risk Scoring for Strictness-Adaptive LLM Content Moderation

arXiv - Machine Learning · 4 min ·
[2602.23528] Neural Operators Can Discover Functional Clusters
Machine Learning

[2602.23528] Neural Operators Can Discover Functional Clusters

Abstract page for arXiv paper 2602.23528: Neural Operators Can Discover Functional Clusters

arXiv - Machine Learning · 4 min ·
[2602.23574] Evidential Neural Radiance Fields
Machine Learning

[2602.23574] Evidential Neural Radiance Fields

Abstract page for arXiv paper 2602.23574: Evidential Neural Radiance Fields

arXiv - AI · 3 min ·
[2602.23546] Humans and LLMs Diverge on Probabilistic Inferences
Llms

[2602.23546] Humans and LLMs Diverge on Probabilistic Inferences

Abstract page for arXiv paper 2602.23546: Humans and LLMs Diverge on Probabilistic Inferences

arXiv - AI · 3 min ·
[2602.23509] SegReg: Latent Space Regularization for Improved Medical Image Segmentation
Machine Learning

[2602.23509] SegReg: Latent Space Regularization for Improved Medical Image Segmentation

Abstract page for arXiv paper 2602.23509: SegReg: Latent Space Regularization for Improved Medical Image Segmentation

arXiv - AI · 3 min ·
[2602.23391] Detoxifying LLMs via Representation Erasure-Based Preference Optimization
Llms

[2602.23391] Detoxifying LLMs via Representation Erasure-Based Preference Optimization

Abstract page for arXiv paper 2602.23391: Detoxifying LLMs via Representation Erasure-Based Preference Optimization

arXiv - Machine Learning · 3 min ·
[2602.23372] Democratizing GraphRAG: Linear, CPU-Only Graph Retrieval for Multi-Hop QA
Llms

[2602.23372] Democratizing GraphRAG: Linear, CPU-Only Graph Retrieval for Multi-Hop QA

Abstract page for arXiv paper 2602.23372: Democratizing GraphRAG: Linear, CPU-Only Graph Retrieval for Multi-Hop QA

arXiv - AI · 3 min ·
[2602.23370] Toward General Semantic Chunking: A Discriminative Framework for Ultra-Long Documents
Llms

[2602.23370] Toward General Semantic Chunking: A Discriminative Framework for Ultra-Long Documents

Abstract page for arXiv paper 2602.23370: Toward General Semantic Chunking: A Discriminative Framework for Ultra-Long Documents

arXiv - AI · 4 min ·
[2602.24195] Uncertainty Quantification for Multimodal Large Language Models with Incoherence-adjusted Semantic Volume
Llms

[2602.24195] Uncertainty Quantification for Multimodal Large Language Models with Incoherence-adjusted Semantic Volume

Abstract page for arXiv paper 2602.24195: Uncertainty Quantification for Multimodal Large Language Models with Incoherence-adjusted Seman...

arXiv - Machine Learning · 4 min ·
[2602.24055] CIRCLE: A Framework for Evaluating AI from a Real-World Lens
Machine Learning

[2602.24055] CIRCLE: A Framework for Evaluating AI from a Real-World Lens

Abstract page for arXiv paper 2602.24055: CIRCLE: A Framework for Evaluating AI from a Real-World Lens

arXiv - AI · 4 min ·
[2602.23720] The Auton Agentic AI Framework
Llms

[2602.23720] The Auton Agentic AI Framework

Abstract page for arXiv paper 2602.23720: The Auton Agentic AI Framework

arXiv - AI · 4 min ·
[2602.23681] ODAR: Principled Adaptive Routing for LLM Reasoning via Active Inference
Llms

[2602.23681] ODAR: Principled Adaptive Routing for LLM Reasoning via Active Inference

Abstract page for arXiv paper 2602.23681: ODAR: Principled Adaptive Routing for LLM Reasoning via Active Inference

arXiv - AI · 4 min ·
Machine Learning

[D] Geospatial ML for humanitarian drought/flood forecasting: critique my approach / ideas for predictive urgency index

I'm working on a non-commercial geospatial ML project (AidMap AI) focused on Central Asia/Afghanistan/Syria – predicting "urgency levels"...

Reddit - Machine Learning · 1 min ·
Machine Learning

[P] easy-torch-tpu: Making it easy to train PyTorch-based models on Google TPUs

I've been working with Google TPU clusters for a few months now, and using PyTorch/XLA to train PyTorch-based models on them has frankly ...

Reddit - Machine Learning · 1 min ·
Ai Infrastructure

OpenAI eyes global domination with $110B Amazon and NVIDIA raise, value hits $840B

submitted by /u/sksarkpoes3 [link] [comments]

Reddit - Artificial Intelligence · 1 min ·
Llms

[R] Benchmarked 94 LLM endpoints for jan 2026. open source is now within 5 quality points of proprietary

been doing a deep dive on model selection for production inference and pulled togethar some numbers from whatllm.org's january 2026 repor...

Reddit - Machine Learning · 1 min ·
Machine Learning

DeepSeek optimizing for Chinese chips

Deepseek is about to drop V4, and the real story isn’t the model. It’s that they’ve optimized it to run on Huawei and Cambricon chips ins...

Reddit - Artificial Intelligence · 1 min ·
Llms

"You are humanity personified in 2076"

A continuation of the first time I did this with a narrative of humanity since the dawn of civilization. Really starting to get into thes...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

[D] got tired of "just vibes" testing for edge ML models, so I built automated quality gates

so about 6 months ago I was messing around with a vision model on a Snapdragon device as a side project. worked great on my laptop. deplo...

Reddit - Machine Learning · 1 min ·
Previous Page 66 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime