AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · 13 minutes ago

Llms

LLM agents can trigger real actions now. But what actually stops them from executing?

We ran into a simple but important issue while building agents with tool calling: the model can propose actions but nothing actually enfo...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Ai Infrastructure

OpenAI, not yet public, raises $3B from retail investors in monster $122B fund raise | TechCrunch

OpenAI's latest funding round, led by Amazon, Nvidia, and SoftBank, values the AI lab at $852 billion as it nears an IPO.

TechCrunch - AI · 4 min · about 5 hours ago

All Content

Machine Learning

[2504.14960] MoE Parallel Folding: Heterogeneous Parallelism Mappings for Efficient Large-Scale MoE Model Training with Megatron Core

Abstract page for arXiv paper 2504.14960: MoE Parallel Folding: Heterogeneous Parallelism Mappings for Efficient Large-Scale MoE Model Tr...

arXiv - Machine Learning · 4 min · 29 days ago

Llms

[2505.21786] VeriTrail: Closed-Domain Hallucination Detection with Traceability

Abstract page for arXiv paper 2505.21786: VeriTrail: Closed-Domain Hallucination Detection with Traceability

arXiv - AI · 3 min · 29 days ago

Llms

[2505.16056] Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models

Abstract page for arXiv paper 2505.16056: Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models

arXiv - Machine Learning · 4 min · 29 days ago

Llms

[2505.17702] Seek-CAD: A Self-refined Generative Modeling for 3D Parametric CAD Using Local Inference via DeepSeek

Abstract page for arXiv paper 2505.17702: Seek-CAD: A Self-refined Generative Modeling for 3D Parametric CAD Using Local Inference via De...

arXiv - AI · 4 min · 29 days ago

Llms

[2505.13109] FreeKV: Boosting KV Cache Retrieval for Efficient LLM Inference

Abstract page for arXiv paper 2505.13109: FreeKV: Boosting KV Cache Retrieval for Efficient LLM Inference

arXiv - Machine Learning · 4 min · 29 days ago

Ai Infrastructure

[2501.08044] UFGraphFR: Graph Federation Recommendation System based on User Text description features

Abstract page for arXiv paper 2501.08044: UFGraphFR: Graph Federation Recommendation System based on User Text description features

arXiv - Machine Learning · 4 min · 29 days ago

Machine Learning

[2502.16411] Predictive AI Can Support Human Learning while Preserving Error Diversity

Abstract page for arXiv paper 2502.16411: Predictive AI Can Support Human Learning while Preserving Error Diversity

arXiv - Machine Learning · 4 min · 29 days ago

Machine Learning

[2305.04979] FedHB: Hierarchical Bayesian Federated Learning

Abstract page for arXiv paper 2305.04979: FedHB: Hierarchical Bayesian Federated Learning

arXiv - Machine Learning · 4 min · 29 days ago

Machine Learning

[2502.01247] Polynomial, trigonometric, and tropical activations

Abstract page for arXiv paper 2502.01247: Polynomial, trigonometric, and tropical activations

arXiv - Machine Learning · 4 min · 29 days ago

Machine Learning

[2603.02194] From Leaderboard to Deployment: Code Quality Challenges in AV Perception Repositories

Abstract page for arXiv paper 2603.02194: From Leaderboard to Deployment: Code Quality Challenges in AV Perception Repositories

arXiv - Machine Learning · 4 min · 29 days ago

Machine Learning

[2603.02159] Instrumental and Proximal Causal Inference with Gaussian Processes

Abstract page for arXiv paper 2603.02159: Instrumental and Proximal Causal Inference with Gaussian Processes

arXiv - Machine Learning · 3 min · 29 days ago

Llms

[2410.13648] SimpleToM: Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs

Abstract page for arXiv paper 2410.13648: SimpleToM: Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs

arXiv - AI · 4 min · 29 days ago

Machine Learning

[2603.02109] Orchestrating Multimodal DNN Workloads in Wireless Neural Processing

Abstract page for arXiv paper 2603.02109: Orchestrating Multimodal DNN Workloads in Wireless Neural Processing

arXiv - Machine Learning · 3 min · 29 days ago

Robotics

[2603.01999] Learning Vision-Based Omnidirectional Navigation: A Teacher-Student Approach Using Monocular Depth Estimation

Abstract page for arXiv paper 2603.01999: Learning Vision-Based Omnidirectional Navigation: A Teacher-Student Approach Using Monocular De...

arXiv - Machine Learning · 4 min · 29 days ago

Machine Learning

[2603.01971] LOCUS: A Distribution-Free Loss-Quantile Score for Risk-Aware Predictions

Abstract page for arXiv paper 2603.01971: LOCUS: A Distribution-Free Loss-Quantile Score for Risk-Aware Predictions

arXiv - Machine Learning · 3 min · 29 days ago

Machine Learning

[2603.01870] Generalizing Logic-based Explanations for Machine Learning Classifiers via Optimization

Abstract page for arXiv paper 2603.01870: Generalizing Logic-based Explanations for Machine Learning Classifiers via Optimization

arXiv - Machine Learning · 4 min · 29 days ago

Llms

[2603.01834] Probing Materials Knowledge in LLMs: From Latent Embeddings to Reliable Predictions

Abstract page for arXiv paper 2603.01834: Probing Materials Knowledge in LLMs: From Latent Embeddings to Reliable Predictions

arXiv - Machine Learning · 3 min · 29 days ago

Llms

[2602.10625] To Think or Not To Think, That is The Question for Large Reasoning Models in Theory of Mind Tasks

Abstract page for arXiv paper 2602.10625: To Think or Not To Think, That is The Question for Large Reasoning Models in Theory of Mind Tasks

arXiv - AI · 4 min · 29 days ago

Llms

[2601.10729] OrbitFlow: SLO-Aware Long-Context LLM Serving with Fine-Grained KV Cache Reconfiguration

Abstract page for arXiv paper 2601.10729: OrbitFlow: SLO-Aware Long-Context LLM Serving with Fine-Grained KV Cache Reconfiguration

arXiv - Machine Learning · 4 min · 29 days ago

Machine Learning

[2601.05724] Overcoming Joint Intractability with Lossless Hierarchical Speculative Decoding

Abstract page for arXiv paper 2601.05724: Overcoming Joint Intractability with Lossless Hierarchical Speculative Decoding

arXiv - AI · 3 min · 29 days ago

Previous Page 49 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence

LLM agents can trigger real actions now. But what actually stops them from executing?

OpenAI, not yet public, raises $3B from retail investors in monster $122B fund raise | TechCrunch

All Content

[2504.14960] MoE Parallel Folding: Heterogeneous Parallelism Mappings for Efficient Large-Scale MoE Model Training with Megatron Core

[2505.21786] VeriTrail: Closed-Domain Hallucination Detection with Traceability

[2505.16056] Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models

[2505.17702] Seek-CAD: A Self-refined Generative Modeling for 3D Parametric CAD Using Local Inference via DeepSeek

[2505.13109] FreeKV: Boosting KV Cache Retrieval for Efficient LLM Inference

[2501.08044] UFGraphFR: Graph Federation Recommendation System based on User Text description features

[2502.16411] Predictive AI Can Support Human Learning while Preserving Error Diversity

[2305.04979] FedHB: Hierarchical Bayesian Federated Learning

[2502.01247] Polynomial, trigonometric, and tropical activations

[2603.02194] From Leaderboard to Deployment: Code Quality Challenges in AV Perception Repositories

[2603.02159] Instrumental and Proximal Causal Inference with Gaussian Processes

[2410.13648] SimpleToM: Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs

[2603.02109] Orchestrating Multimodal DNN Workloads in Wireless Neural Processing

[2603.01999] Learning Vision-Based Omnidirectional Navigation: A Teacher-Student Approach Using Monocular Depth Estimation

[2603.01971] LOCUS: A Distribution-Free Loss-Quantile Score for Risk-Aware Predictions

[2603.01870] Generalizing Logic-based Explanations for Machine Learning Classifiers via Optimization

[2603.01834] Probing Materials Knowledge in LLMs: From Latent Embeddings to Reliable Predictions

[2602.10625] To Think or Not To Think, That is The Question for Large Reasoning Models in Theory of Mind Tasks

[2601.10729] OrbitFlow: SLO-Aware Long-Context LLM Serving with Fine-Grained KV Cache Reconfiguration

[2601.05724] Overcoming Joint Intractability with Lossless Hierarchical Speculative Decoding

Related Topics

Stay updated with AI News