AI Infrastructure

GPUs, training clusters, MLOps, and deployment

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Ai Infrastructure

[P] GPU friendly lossless 12-bit BF16 format with 0.03% escape rate and 1 integer ADD decode works for AMD & NVIDIA

Hi everyone : ) I just released a new research prototype It’s a lossless BF16 compression format that stores weights in 12 bits by replac...

Reddit - Machine Learning · 1 min ·
OpenAI’s Fidji Simo Is Taking Medical Leave Amid an Executive Shake-Up | WIRED
Ai Infrastructure

OpenAI’s Fidji Simo Is Taking Medical Leave Amid an Executive Shake-Up | WIRED

The company is undergoing major leadership restructuring as its CEO of AGI deployment goes on leave for “several weeks.”

Wired - AI · 5 min ·

All Content

[2602.01776] Position: Beyond Model-Centric Prediction -- Agentic Time Series Forecasting
Machine Learning

[2602.01776] Position: Beyond Model-Centric Prediction -- Agentic Time Series Forecasting

Abstract page for arXiv paper 2602.01776: Position: Beyond Model-Centric Prediction -- Agentic Time Series Forecasting

arXiv - Machine Learning · 4 min ·
[2602.07319] Beyond Accuracy: Risk-Sensitive Evaluation of Hallucinated Medical Advice
Llms

[2602.07319] Beyond Accuracy: Risk-Sensitive Evaluation of Hallucinated Medical Advice

Abstract page for arXiv paper 2602.07319: Beyond Accuracy: Risk-Sensitive Evaluation of Hallucinated Medical Advice

arXiv - AI · 3 min ·
[2512.13352] On the Effectiveness of Membership Inference in Targeted Data Extraction from Large Language Models
Llms

[2512.13352] On the Effectiveness of Membership Inference in Targeted Data Extraction from Large Language Models

Abstract page for arXiv paper 2512.13352: On the Effectiveness of Membership Inference in Targeted Data Extraction from Large Language Mo...

arXiv - Machine Learning · 3 min ·
[2510.17268] Uncertainty-aware data assimilation through variational inference
Machine Learning

[2510.17268] Uncertainty-aware data assimilation through variational inference

Abstract page for arXiv paper 2510.17268: Uncertainty-aware data assimilation through variational inference

arXiv - Machine Learning · 3 min ·
[2509.24762] In-Context Learning of Temporal Point Processes with Foundation Inference Models
Machine Learning

[2509.24762] In-Context Learning of Temporal Point Processes with Foundation Inference Models

Abstract page for arXiv paper 2509.24762: In-Context Learning of Temporal Point Processes with Foundation Inference Models

arXiv - Machine Learning · 3 min ·
[2511.17812] Score-Regularized Joint Sampling with Importance Weights for Flow Matching
Machine Learning

[2511.17812] Score-Regularized Joint Sampling with Importance Weights for Flow Matching

Abstract page for arXiv paper 2511.17812: Score-Regularized Joint Sampling with Importance Weights for Flow Matching

arXiv - Machine Learning · 4 min ·
[2511.15927] DiffuMamba: High-Throughput Diffusion LMs with Mamba Backbone
Llms

[2511.15927] DiffuMamba: High-Throughput Diffusion LMs with Mamba Backbone

Abstract page for arXiv paper 2511.15927: DiffuMamba: High-Throughput Diffusion LMs with Mamba Backbone

arXiv - Machine Learning · 3 min ·
[2508.05190] Physics-Informed Time-Integrated DeepONet: Temporal Tangent Space Operator Learning for High-Accuracy Inference
Machine Learning

[2508.05190] Physics-Informed Time-Integrated DeepONet: Temporal Tangent Space Operator Learning for High-Accuracy Inference

Abstract page for arXiv paper 2508.05190: Physics-Informed Time-Integrated DeepONet: Temporal Tangent Space Operator Learning for High-Ac...

arXiv - Machine Learning · 4 min ·
[2510.06646] The False Promise of Zero-Shot Super-Resolution in Machine-Learned Operators
Machine Learning

[2510.06646] The False Promise of Zero-Shot Super-Resolution in Machine-Learned Operators

Abstract page for arXiv paper 2510.06646: The False Promise of Zero-Shot Super-Resolution in Machine-Learned Operators

arXiv - Machine Learning · 4 min ·
[2510.05535] Permutation-Invariant Representation Learning for Robust and Privacy-Preserving Feature Selection
Nlp

[2510.05535] Permutation-Invariant Representation Learning for Robust and Privacy-Preserving Feature Selection

Abstract page for arXiv paper 2510.05535: Permutation-Invariant Representation Learning for Robust and Privacy-Preserving Feature Selection

arXiv - Machine Learning · 4 min ·
[2503.12354] Probabilistic Neural Networks (PNNs) with t-Distributed Outputs: Adaptive Prediction Intervals Beyond Gaussian Assumptions
Machine Learning

[2503.12354] Probabilistic Neural Networks (PNNs) with t-Distributed Outputs: Adaptive Prediction Intervals Beyond Gaussian Assumptions

Abstract page for arXiv paper 2503.12354: Probabilistic Neural Networks (PNNs) with t-Distributed Outputs: Adaptive Prediction Intervals ...

arXiv - Machine Learning · 4 min ·
[2412.15176] Rethinking Uncertainty Estimation in LLMs: A Principled Single-Sequence Measure
Llms

[2412.15176] Rethinking Uncertainty Estimation in LLMs: A Principled Single-Sequence Measure

Abstract page for arXiv paper 2412.15176: Rethinking Uncertainty Estimation in LLMs: A Principled Single-Sequence Measure

arXiv - Machine Learning · 3 min ·
[2508.21048] Veritas: Generalizable Deepfake Detection via Pattern-Aware Reasoning
Machine Learning

[2508.21048] Veritas: Generalizable Deepfake Detection via Pattern-Aware Reasoning

Abstract page for arXiv paper 2508.21048: Veritas: Generalizable Deepfake Detection via Pattern-Aware Reasoning

arXiv - AI · 4 min ·
[2508.18395] Latent Self-Consistency for Reliable Majority-Set Selection in Short- and Long-Answer Reasoning
Llms

[2508.18395] Latent Self-Consistency for Reliable Majority-Set Selection in Short- and Long-Answer Reasoning

Abstract page for arXiv paper 2508.18395: Latent Self-Consistency for Reliable Majority-Set Selection in Short- and Long-Answer Reasoning

arXiv - AI · 4 min ·
[2508.16242] A Reduction of Input/Output Logics to SAT
Ai Infrastructure

[2508.16242] A Reduction of Input/Output Logics to SAT

Abstract page for arXiv paper 2508.16242: A Reduction of Input/Output Logics to SAT

arXiv - AI · 3 min ·
[2508.09904] Beyond Naïve Prompting: Strategies for Improved Context-aided Forecasting with LLMs
Llms

[2508.09904] Beyond Naïve Prompting: Strategies for Improved Context-aided Forecasting with LLMs

Abstract page for arXiv paper 2508.09904: Beyond Naïve Prompting: Strategies for Improved Context-aided Forecasting with LLMs

arXiv - Machine Learning · 4 min ·
[2602.24208] SenCache: Accelerating Diffusion Model Inference via Sensitivity-Aware Caching
Machine Learning

[2602.24208] SenCache: Accelerating Diffusion Model Inference via Sensitivity-Aware Caching

Abstract page for arXiv paper 2602.24208: SenCache: Accelerating Diffusion Model Inference via Sensitivity-Aware Caching

arXiv - Machine Learning · 3 min ·
[2507.02965] Concept-based Adversarial Attack: a Probabilistic Perspective
Ai Infrastructure

[2507.02965] Concept-based Adversarial Attack: a Probabilistic Perspective

Abstract page for arXiv paper 2507.02965: Concept-based Adversarial Attack: a Probabilistic Perspective

arXiv - AI · 3 min ·
[2602.24086] The Subjectivity of Monoculture
Llms

[2602.24086] The Subjectivity of Monoculture

Abstract page for arXiv paper 2602.24086: The Subjectivity of Monoculture

arXiv - Machine Learning · 3 min ·
[2602.24007] Inference-time optimization for experiment-grounded protein ensemble generation
Machine Learning

[2602.24007] Inference-time optimization for experiment-grounded protein ensemble generation

Abstract page for arXiv paper 2602.24007: Inference-time optimization for experiment-grounded protein ensemble generation

arXiv - Machine Learning · 4 min ·
Previous Page 64 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime