AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

The AI Chip War is Just Getting Started

Everyone talks about AI models, but the real bottleneck might be hardware. According to a recent study by Roots Analysis: AI chip market ...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 7 hours ago

Llms

[2603.16430] EngGPT2: Sovereign, Efficient and Open Intelligence

Abstract page for arXiv paper 2603.16430: EngGPT2: Sovereign, Efficient and Open Intelligence

arXiv - AI · 4 min · about 10 hours ago

All Content

Machine Learning

[2510.08219] Post-hoc Stochastic Concept Bottleneck Models

Abstract page for arXiv paper 2510.08219: Post-hoc Stochastic Concept Bottleneck Models

arXiv - Machine Learning · 4 min · 27 days ago

Machine Learning

[2509.23265] CREPE: Controlling Diffusion with Replica Exchange

Abstract page for arXiv paper 2509.23265: CREPE: Controlling Diffusion with Replica Exchange

arXiv - Machine Learning · 3 min · 27 days ago

Llms

[2509.23202] Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization

Abstract page for arXiv paper 2509.23202: Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization

arXiv - Machine Learning · 4 min · 27 days ago

$[2503.01804] $\texttt{SEM-CTRL}$: Semantically Controlled Decoding$

Llms

[2503.01804] $\texttt{SEM-CTRL}$: Semantically Controlled Decoding

Abstract page for arXiv paper 2503.01804: $\texttt{SEM-CTRL}$: Semantically Controlled Decoding

arXiv - Machine Learning · 3 min · 27 days ago

Llms

[2407.16893] The Price of Prompting: Profiling Energy Use in Large Language Models Inference

Abstract page for arXiv paper 2407.16893: The Price of Prompting: Profiling Energy Use in Large Language Models Inference

arXiv - AI · 4 min · 27 days ago

Machine Learning

[2506.05668] RNE: plug-and-play diffusion inference-time control and energy-based training

Abstract page for arXiv paper 2506.05668: RNE: plug-and-play diffusion inference-time control and energy-based training

arXiv - Machine Learning · 4 min · 27 days ago

Machine Learning

[2505.18017] Strictly Constrained Generative Modeling via Split Augmented Langevin Sampling

Abstract page for arXiv paper 2505.18017: Strictly Constrained Generative Modeling via Split Augmented Langevin Sampling

arXiv - Machine Learning · 4 min · 27 days ago

Llms

[2510.06410] Off-Trajectory Reasoning: Can LLMs Collaborate on Reasoning Trajectory?

Abstract page for arXiv paper 2510.06410: Off-Trajectory Reasoning: Can LLMs Collaborate on Reasoning Trajectory?

arXiv - AI · 4 min · 27 days ago

Llms

[2505.19892] OptMerge: Unifying Multimodal LLM Capabilities and Modalities via Model Merging

Abstract page for arXiv paper 2505.19892: OptMerge: Unifying Multimodal LLM Capabilities and Modalities via Model Merging

arXiv - AI · 4 min · 27 days ago

Machine Learning

[2502.13731] Robust Counterfactual Inference in Markov Decision Processes

Abstract page for arXiv paper 2502.13731: Robust Counterfactual Inference in Markov Decision Processes

arXiv - AI · 3 min · 27 days ago

Machine Learning

[2603.03163] Conditioned Activation Transport for T2I Safety Steering

Abstract page for arXiv paper 2603.03163: Conditioned Activation Transport for T2I Safety Steering

arXiv - AI · 3 min · 27 days ago

Nlp

[2603.03188] Scalable Uncertainty Quantification for Black-Box Density-Based Clustering

Abstract page for arXiv paper 2603.03188: Scalable Uncertainty Quantification for Black-Box Density-Based Clustering

arXiv - Machine Learning · 3 min · 27 days ago

Machine Learning

[2603.03146] Channel-Adaptive Edge AI: Maximizing Inference Throughput by Adapting Computational Complexity to Channel States

Abstract page for arXiv paper 2603.03146: Channel-Adaptive Edge AI: Maximizing Inference Throughput by Adapting Computational Complexity ...

arXiv - Machine Learning · 4 min · 27 days ago

Machine Learning

[2603.03035] Generalized Bayes for Causal Inference

Abstract page for arXiv paper 2603.03035: Generalized Bayes for Causal Inference

arXiv - Machine Learning · 4 min · 27 days ago

Machine Learning

[2603.03101] MoECLIP: Patch-Specialized Experts for Zero-shot Anomaly Detection

Abstract page for arXiv paper 2603.03101: MoECLIP: Patch-Specialized Experts for Zero-shot Anomaly Detection

arXiv - AI · 4 min · 27 days ago

Machine Learning

[2603.03075] TinyIceNet: Low-Power SAR Sea Ice Segmentation for On-Board FPGA Inference

Abstract page for arXiv paper 2603.03075: TinyIceNet: Low-Power SAR Sea Ice Segmentation for On-Board FPGA Inference

arXiv - AI · 4 min · 27 days ago

Generative Ai

[2603.03074] Design Generative AI for Practitioners: Exploring Interaction Approaches Aligned with Creative Practice

Abstract page for arXiv paper 2603.03074: Design Generative AI for Practitioners: Exploring Interaction Approaches Aligned with Creative ...

arXiv - AI · 3 min · 27 days ago

Llms

[2603.03047] TrustMH-Bench: A Comprehensive Benchmark for Evaluating the Trustworthiness of Large Language Models in Mental Health

Abstract page for arXiv paper 2603.03047: TrustMH-Bench: A Comprehensive Benchmark for Evaluating the Trustworthiness of Large Language M...

arXiv - AI · 4 min · 27 days ago

Machine Learning

[2603.02961] Delegation and Verification Under AI

Abstract page for arXiv paper 2603.02961: Delegation and Verification Under AI

arXiv - AI · 3 min · 27 days ago

Llms

[2603.02949] SEALing the Gap: A Reference Framework for LLM Inference Carbon Estimation via Multi-Benchmark Driven Embodiment

Abstract page for arXiv paper 2603.02949: SEALing the Gap: A Reference Framework for LLM Inference Carbon Estimation via Multi-Benchmark ...

arXiv - AI · 3 min · 27 days ago

Previous Page 42 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

The AI Chip War is Just Getting Started

UMKC Announces New Master of Science in Artificial Intelligence

[2603.16430] EngGPT2: Sovereign, Efficient and Open Intelligence

All Content

[2510.08219] Post-hoc Stochastic Concept Bottleneck Models

[2509.23265] CREPE: Controlling Diffusion with Replica Exchange

[2509.23202] Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization

[2503.01804] $\texttt{SEM-CTRL}$: Semantically Controlled Decoding

[2407.16893] The Price of Prompting: Profiling Energy Use in Large Language Models Inference

[2506.05668] RNE: plug-and-play diffusion inference-time control and energy-based training

[2505.18017] Strictly Constrained Generative Modeling via Split Augmented Langevin Sampling

[2510.06410] Off-Trajectory Reasoning: Can LLMs Collaborate on Reasoning Trajectory?

[2505.19892] OptMerge: Unifying Multimodal LLM Capabilities and Modalities via Model Merging

[2502.13731] Robust Counterfactual Inference in Markov Decision Processes

[2603.03163] Conditioned Activation Transport for T2I Safety Steering

[2603.03188] Scalable Uncertainty Quantification for Black-Box Density-Based Clustering

[2603.03146] Channel-Adaptive Edge AI: Maximizing Inference Throughput by Adapting Computational Complexity to Channel States

[2603.03035] Generalized Bayes for Causal Inference

[2603.03101] MoECLIP: Patch-Specialized Experts for Zero-shot Anomaly Detection

[2603.03075] TinyIceNet: Low-Power SAR Sea Ice Segmentation for On-Board FPGA Inference

[2603.03074] Design Generative AI for Practitioners: Exploring Interaction Approaches Aligned with Creative Practice

[2603.03047] TrustMH-Bench: A Comprehensive Benchmark for Evaluating the Trustworthiness of Large Language Models in Mental Health

[2603.02961] Delegation and Verification Under AI

[2603.02949] SEALing the Gap: A Reference Framework for LLM Inference Carbon Estimation via Multi-Benchmark Driven Embodiment

Related Topics

Stay updated with AI News