AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 1 hour ago

Ai Infrastructure

[P] GPU friendly lossless 12-bit BF16 format with 0.03% escape rate and 1 integer ADD decode works for AMD & NVIDIA

Hi everyone : ) I just released a new research prototype It’s a lossless BF16 compression format that stores weights in 12 bits by replac...

Reddit - Machine Learning · 1 min · about 1 hour ago

Ai Infrastructure

OpenAI’s Fidji Simo Is Taking Medical Leave Amid an Executive Shake-Up | WIRED

The company is undergoing major leadership restructuring as its CEO of AGI deployment goes on leave for “several weeks.”

Wired - AI · 5 min · about 5 hours ago

All Content

Machine Learning

[2602.01776] Position: Beyond Model-Centric Prediction -- Agentic Time Series Forecasting

Abstract page for arXiv paper 2602.01776: Position: Beyond Model-Centric Prediction -- Agentic Time Series Forecasting

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2602.07319] Beyond Accuracy: Risk-Sensitive Evaluation of Hallucinated Medical Advice

Abstract page for arXiv paper 2602.07319: Beyond Accuracy: Risk-Sensitive Evaluation of Hallucinated Medical Advice

arXiv - AI · 3 min · about 1 month ago

Llms

[2512.13352] On the Effectiveness of Membership Inference in Targeted Data Extraction from Large Language Models

Abstract page for arXiv paper 2512.13352: On the Effectiveness of Membership Inference in Targeted Data Extraction from Large Language Mo...

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2510.17268] Uncertainty-aware data assimilation through variational inference

Abstract page for arXiv paper 2510.17268: Uncertainty-aware data assimilation through variational inference

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2509.24762] In-Context Learning of Temporal Point Processes with Foundation Inference Models

Abstract page for arXiv paper 2509.24762: In-Context Learning of Temporal Point Processes with Foundation Inference Models

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2511.17812] Score-Regularized Joint Sampling with Importance Weights for Flow Matching

Abstract page for arXiv paper 2511.17812: Score-Regularized Joint Sampling with Importance Weights for Flow Matching

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2511.15927] DiffuMamba: High-Throughput Diffusion LMs with Mamba Backbone

Abstract page for arXiv paper 2511.15927: DiffuMamba: High-Throughput Diffusion LMs with Mamba Backbone

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2508.05190] Physics-Informed Time-Integrated DeepONet: Temporal Tangent Space Operator Learning for High-Accuracy Inference

Abstract page for arXiv paper 2508.05190: Physics-Informed Time-Integrated DeepONet: Temporal Tangent Space Operator Learning for High-Ac...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2510.06646] The False Promise of Zero-Shot Super-Resolution in Machine-Learned Operators

Abstract page for arXiv paper 2510.06646: The False Promise of Zero-Shot Super-Resolution in Machine-Learned Operators

arXiv - Machine Learning · 4 min · about 1 month ago

Nlp

[2510.05535] Permutation-Invariant Representation Learning for Robust and Privacy-Preserving Feature Selection

Abstract page for arXiv paper 2510.05535: Permutation-Invariant Representation Learning for Robust and Privacy-Preserving Feature Selection

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2503.12354] Probabilistic Neural Networks (PNNs) with t-Distributed Outputs: Adaptive Prediction Intervals Beyond Gaussian Assumptions

Abstract page for arXiv paper 2503.12354: Probabilistic Neural Networks (PNNs) with t-Distributed Outputs: Adaptive Prediction Intervals ...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2412.15176] Rethinking Uncertainty Estimation in LLMs: A Principled Single-Sequence Measure

Abstract page for arXiv paper 2412.15176: Rethinking Uncertainty Estimation in LLMs: A Principled Single-Sequence Measure

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2508.21048] Veritas: Generalizable Deepfake Detection via Pattern-Aware Reasoning

Abstract page for arXiv paper 2508.21048: Veritas: Generalizable Deepfake Detection via Pattern-Aware Reasoning

arXiv - AI · 4 min · about 1 month ago

Llms

[2508.18395] Latent Self-Consistency for Reliable Majority-Set Selection in Short- and Long-Answer Reasoning

Abstract page for arXiv paper 2508.18395: Latent Self-Consistency for Reliable Majority-Set Selection in Short- and Long-Answer Reasoning

arXiv - AI · 4 min · about 1 month ago

Ai Infrastructure

[2508.16242] A Reduction of Input/Output Logics to SAT

Abstract page for arXiv paper 2508.16242: A Reduction of Input/Output Logics to SAT

arXiv - AI · 3 min · about 1 month ago

Llms

[2508.09904] Beyond Naïve Prompting: Strategies for Improved Context-aided Forecasting with LLMs

Abstract page for arXiv paper 2508.09904: Beyond Naïve Prompting: Strategies for Improved Context-aided Forecasting with LLMs

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.24208] SenCache: Accelerating Diffusion Model Inference via Sensitivity-Aware Caching

Abstract page for arXiv paper 2602.24208: SenCache: Accelerating Diffusion Model Inference via Sensitivity-Aware Caching

arXiv - Machine Learning · 3 min · about 1 month ago

Ai Infrastructure

[2507.02965] Concept-based Adversarial Attack: a Probabilistic Perspective

Abstract page for arXiv paper 2507.02965: Concept-based Adversarial Attack: a Probabilistic Perspective

arXiv - AI · 3 min · about 1 month ago

Llms

[2602.24086] The Subjectivity of Monoculture

Abstract page for arXiv paper 2602.24086: The Subjectivity of Monoculture

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2602.24007] Inference-time optimization for experiment-grounded protein ensemble generation

Abstract page for arXiv paper 2602.24007: Inference-time optimization for experiment-grounded protein ensemble generation

arXiv - Machine Learning · 4 min · about 1 month ago

Previous Page 64 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence

[P] GPU friendly lossless 12-bit BF16 format with 0.03% escape rate and 1 integer ADD decode works for AMD & NVIDIA

OpenAI’s Fidji Simo Is Taking Medical Leave Amid an Executive Shake-Up | WIRED

All Content

[2602.01776] Position: Beyond Model-Centric Prediction -- Agentic Time Series Forecasting

[2602.07319] Beyond Accuracy: Risk-Sensitive Evaluation of Hallucinated Medical Advice

[2512.13352] On the Effectiveness of Membership Inference in Targeted Data Extraction from Large Language Models

[2510.17268] Uncertainty-aware data assimilation through variational inference

[2509.24762] In-Context Learning of Temporal Point Processes with Foundation Inference Models

[2511.17812] Score-Regularized Joint Sampling with Importance Weights for Flow Matching

[2511.15927] DiffuMamba: High-Throughput Diffusion LMs with Mamba Backbone

[2508.05190] Physics-Informed Time-Integrated DeepONet: Temporal Tangent Space Operator Learning for High-Accuracy Inference

[2510.06646] The False Promise of Zero-Shot Super-Resolution in Machine-Learned Operators

[2510.05535] Permutation-Invariant Representation Learning for Robust and Privacy-Preserving Feature Selection

[2503.12354] Probabilistic Neural Networks (PNNs) with t-Distributed Outputs: Adaptive Prediction Intervals Beyond Gaussian Assumptions

[2412.15176] Rethinking Uncertainty Estimation in LLMs: A Principled Single-Sequence Measure

[2508.21048] Veritas: Generalizable Deepfake Detection via Pattern-Aware Reasoning

[2508.18395] Latent Self-Consistency for Reliable Majority-Set Selection in Short- and Long-Answer Reasoning

[2508.16242] A Reduction of Input/Output Logics to SAT

[2508.09904] Beyond Naïve Prompting: Strategies for Improved Context-aided Forecasting with LLMs

[2602.24208] SenCache: Accelerating Diffusion Model Inference via Sensitivity-Aware Caching

[2507.02965] Concept-based Adversarial Attack: a Probabilistic Perspective

[2602.24086] The Subjectivity of Monoculture

[2602.24007] Inference-time optimization for experiment-grounded protein ensemble generation

Related Topics

Stay updated with AI News