AI Infrastructure

GPUs, training clusters, MLOps, and deployment

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Llms

LLM agents can trigger real actions now. But what actually stops them from executing?

We ran into a simple but important issue while building agents with tool calling: the model can propose actions but nothing actually enfo...

Reddit - Artificial Intelligence · 1 min ·
OpenAI, not yet public, raises $3B from retail investors in monster $122B fund raise | TechCrunch
Ai Infrastructure

OpenAI, not yet public, raises $3B from retail investors in monster $122B fund raise | TechCrunch

OpenAI's latest funding round, led by Amazon, Nvidia, and SoftBank, values the AI lab at $852 billion as it nears an IPO.

TechCrunch - AI · 4 min ·

All Content

[2504.14960] MoE Parallel Folding: Heterogeneous Parallelism Mappings for Efficient Large-Scale MoE Model Training with Megatron Core
Machine Learning

[2504.14960] MoE Parallel Folding: Heterogeneous Parallelism Mappings for Efficient Large-Scale MoE Model Training with Megatron Core

Abstract page for arXiv paper 2504.14960: MoE Parallel Folding: Heterogeneous Parallelism Mappings for Efficient Large-Scale MoE Model Tr...

arXiv - Machine Learning · 4 min ·
[2505.21786] VeriTrail: Closed-Domain Hallucination Detection with Traceability
Llms

[2505.21786] VeriTrail: Closed-Domain Hallucination Detection with Traceability

Abstract page for arXiv paper 2505.21786: VeriTrail: Closed-Domain Hallucination Detection with Traceability

arXiv - AI · 3 min ·
[2505.16056] Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models
Llms

[2505.16056] Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models

Abstract page for arXiv paper 2505.16056: Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models

arXiv - Machine Learning · 4 min ·
[2505.17702] Seek-CAD: A Self-refined Generative Modeling for 3D Parametric CAD Using Local Inference via DeepSeek
Llms

[2505.17702] Seek-CAD: A Self-refined Generative Modeling for 3D Parametric CAD Using Local Inference via DeepSeek

Abstract page for arXiv paper 2505.17702: Seek-CAD: A Self-refined Generative Modeling for 3D Parametric CAD Using Local Inference via De...

arXiv - AI · 4 min ·
[2505.13109] FreeKV: Boosting KV Cache Retrieval for Efficient LLM Inference
Llms

[2505.13109] FreeKV: Boosting KV Cache Retrieval for Efficient LLM Inference

Abstract page for arXiv paper 2505.13109: FreeKV: Boosting KV Cache Retrieval for Efficient LLM Inference

arXiv - Machine Learning · 4 min ·
[2501.08044] UFGraphFR: Graph Federation Recommendation System based on User Text description features
Ai Infrastructure

[2501.08044] UFGraphFR: Graph Federation Recommendation System based on User Text description features

Abstract page for arXiv paper 2501.08044: UFGraphFR: Graph Federation Recommendation System based on User Text description features

arXiv - Machine Learning · 4 min ·
[2502.16411] Predictive AI Can Support Human Learning while Preserving Error Diversity
Machine Learning

[2502.16411] Predictive AI Can Support Human Learning while Preserving Error Diversity

Abstract page for arXiv paper 2502.16411: Predictive AI Can Support Human Learning while Preserving Error Diversity

arXiv - Machine Learning · 4 min ·
[2305.04979] FedHB: Hierarchical Bayesian Federated Learning
Machine Learning

[2305.04979] FedHB: Hierarchical Bayesian Federated Learning

Abstract page for arXiv paper 2305.04979: FedHB: Hierarchical Bayesian Federated Learning

arXiv - Machine Learning · 4 min ·
[2502.01247] Polynomial, trigonometric, and tropical activations
Machine Learning

[2502.01247] Polynomial, trigonometric, and tropical activations

Abstract page for arXiv paper 2502.01247: Polynomial, trigonometric, and tropical activations

arXiv - Machine Learning · 4 min ·
[2603.02194] From Leaderboard to Deployment: Code Quality Challenges in AV Perception Repositories
Machine Learning

[2603.02194] From Leaderboard to Deployment: Code Quality Challenges in AV Perception Repositories

Abstract page for arXiv paper 2603.02194: From Leaderboard to Deployment: Code Quality Challenges in AV Perception Repositories

arXiv - Machine Learning · 4 min ·
[2603.02159] Instrumental and Proximal Causal Inference with Gaussian Processes
Machine Learning

[2603.02159] Instrumental and Proximal Causal Inference with Gaussian Processes

Abstract page for arXiv paper 2603.02159: Instrumental and Proximal Causal Inference with Gaussian Processes

arXiv - Machine Learning · 3 min ·
[2410.13648] SimpleToM: Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs
Llms

[2410.13648] SimpleToM: Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs

Abstract page for arXiv paper 2410.13648: SimpleToM: Exposing the Gap between Explicit ToM Inference and Implicit ToM Application in LLMs

arXiv - AI · 4 min ·
[2603.02109] Orchestrating Multimodal DNN Workloads in Wireless Neural Processing
Machine Learning

[2603.02109] Orchestrating Multimodal DNN Workloads in Wireless Neural Processing

Abstract page for arXiv paper 2603.02109: Orchestrating Multimodal DNN Workloads in Wireless Neural Processing

arXiv - Machine Learning · 3 min ·
[2603.01999] Learning Vision-Based Omnidirectional Navigation: A Teacher-Student Approach Using Monocular Depth Estimation
Robotics

[2603.01999] Learning Vision-Based Omnidirectional Navigation: A Teacher-Student Approach Using Monocular Depth Estimation

Abstract page for arXiv paper 2603.01999: Learning Vision-Based Omnidirectional Navigation: A Teacher-Student Approach Using Monocular De...

arXiv - Machine Learning · 4 min ·
[2603.01971] LOCUS: A Distribution-Free Loss-Quantile Score for Risk-Aware Predictions
Machine Learning

[2603.01971] LOCUS: A Distribution-Free Loss-Quantile Score for Risk-Aware Predictions

Abstract page for arXiv paper 2603.01971: LOCUS: A Distribution-Free Loss-Quantile Score for Risk-Aware Predictions

arXiv - Machine Learning · 3 min ·
[2603.01870] Generalizing Logic-based Explanations for Machine Learning Classifiers via Optimization
Machine Learning

[2603.01870] Generalizing Logic-based Explanations for Machine Learning Classifiers via Optimization

Abstract page for arXiv paper 2603.01870: Generalizing Logic-based Explanations for Machine Learning Classifiers via Optimization

arXiv - Machine Learning · 4 min ·
[2603.01834] Probing Materials Knowledge in LLMs: From Latent Embeddings to Reliable Predictions
Llms

[2603.01834] Probing Materials Knowledge in LLMs: From Latent Embeddings to Reliable Predictions

Abstract page for arXiv paper 2603.01834: Probing Materials Knowledge in LLMs: From Latent Embeddings to Reliable Predictions

arXiv - Machine Learning · 3 min ·
[2602.10625] To Think or Not To Think, That is The Question for Large Reasoning Models in Theory of Mind Tasks
Llms

[2602.10625] To Think or Not To Think, That is The Question for Large Reasoning Models in Theory of Mind Tasks

Abstract page for arXiv paper 2602.10625: To Think or Not To Think, That is The Question for Large Reasoning Models in Theory of Mind Tasks

arXiv - AI · 4 min ·
[2601.10729] OrbitFlow: SLO-Aware Long-Context LLM Serving with Fine-Grained KV Cache Reconfiguration
Llms

[2601.10729] OrbitFlow: SLO-Aware Long-Context LLM Serving with Fine-Grained KV Cache Reconfiguration

Abstract page for arXiv paper 2601.10729: OrbitFlow: SLO-Aware Long-Context LLM Serving with Fine-Grained KV Cache Reconfiguration

arXiv - Machine Learning · 4 min ·
[2601.05724] Overcoming Joint Intractability with Lossless Hierarchical Speculative Decoding
Machine Learning

[2601.05724] Overcoming Joint Intractability with Lossless Hierarchical Speculative Decoding

Abstract page for arXiv paper 2601.05724: Overcoming Joint Intractability with Lossless Hierarchical Speculative Decoding

arXiv - AI · 3 min ·
Previous Page 49 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime