AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Infrastructure

[D] thoughts on the controversy about Google's new paper?

Openreview: https://openreview.net/forum?id=tO3ASKZlok It's sad to see almost no one mention this on Reddit and people are being mean to ...

Reddit - Machine Learning · 1 min · about 2 hours ago

Machine Learning

[D] MXFP8 GEMM: Up to 99% of cuBLAS performance using CUDA + PTX

New blog post by Daniel Vega-Myhre (Meta/PyTorch) illustrating GEMM design for FP8, including deep-dives into all the constraints and des...

Reddit - Machine Learning · 1 min · about 4 hours ago

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 5 hours ago

All Content

Llms

[2603.18048] DEAF: A Benchmark for Diagnostic Evaluation of Acoustic Faithfulness in Audio Language Models

Abstract page for arXiv paper 2603.18048: DEAF: A Benchmark for Diagnostic Evaluation of Acoustic Faithfulness in Audio Language Models

arXiv - AI · 4 min · 7 days ago

Machine Learning

[2511.17038] DAPS++: Rethinking Diffusion Inverse Problems with Decoupled Posterior Annealing

Abstract page for arXiv paper 2511.17038: DAPS++: Rethinking Diffusion Inverse Problems with Decoupled Posterior Annealing

arXiv - AI · 3 min · 7 days ago

Ai Safety

[2509.19464] Evaluation-Aware Reinforcement Learning

Abstract page for arXiv paper 2509.19464: Evaluation-Aware Reinforcement Learning

arXiv - Machine Learning · 3 min · 7 days ago

Llms

[2603.20180] Adaptive Greedy Frame Selection for Long Video Understanding

Abstract page for arXiv paper 2603.20180: Adaptive Greedy Frame Selection for Long Video Understanding

arXiv - AI · 4 min · 7 days ago

Llms

[2603.20161] Semantic Token Clustering for Efficient Uncertainty Quantification in Large Language Models

Abstract page for arXiv paper 2603.20161: Semantic Token Clustering for Efficient Uncertainty Quantification in Large Language Models

arXiv - Machine Learning · 3 min · 7 days ago

Machine Learning

[2603.19970] Graph2TS: Structure-Controlled Time Series Generation via Quantile-Graph VAEs

Abstract page for arXiv paper 2603.19970: Graph2TS: Structure-Controlled Time Series Generation via Quantile-Graph VAEs

arXiv - AI · 4 min · 7 days ago

Llms

[2603.19957] HiPath: Hierarchical Vision-Language Alignment for Structured Pathology Report Prediction

Abstract page for arXiv paper 2603.19957: HiPath: Hierarchical Vision-Language Alignment for Structured Pathology Report Prediction

arXiv - Machine Learning · 3 min · 7 days ago

Ai Infrastructure

[2603.19918] Learning Like Humans: Analogical Concept Learning for Generalized Category Discovery

Abstract page for arXiv paper 2603.19918: Learning Like Humans: Analogical Concept Learning for Generalized Category Discovery

arXiv - AI · 3 min · 7 days ago

Machine Learning

[2603.19757] Uncertainty-aware Prototype Learning with Variational Inference for Few-shot Point Cloud Segmentation

Abstract page for arXiv paper 2603.19757: Uncertainty-aware Prototype Learning with Variational Inference for Few-shot Point Cloud Segmen...

arXiv - AI · 4 min · 7 days ago

Machine Learning

[2603.19664] The Residual Stream Is All You Need: On the Redundancy of the KV Cache in Transformer Inference

Abstract page for arXiv paper 2603.19664: The Residual Stream Is All You Need: On the Redundancy of the KV Cache in Transformer Inference

arXiv - AI · 4 min · 7 days ago

Machine Learning

[2603.19643] OmniDiT: Extending Diffusion Transformer to Omni-VTON Framework

Abstract page for arXiv paper 2603.19643: OmniDiT: Extending Diffusion Transformer to Omni-VTON Framework

arXiv - AI · 4 min · 7 days ago

Generative Ai

[2603.19634] MetaCues: Enabling Critical Engagement with Generative AI for Information Seeking and Sensemaking

Abstract page for arXiv paper 2603.19634: MetaCues: Enabling Critical Engagement with Generative AI for Information Seeking and Sensemaking

arXiv - AI · 4 min · 7 days ago

Machine Learning

[2603.19594] ARMOR: Adaptive Resilience Against Model Poisoning Attacks in Continual Federated Learning for Mobile Indoor Localization

Abstract page for arXiv paper 2603.19594: ARMOR: Adaptive Resilience Against Model Poisoning Attacks in Continual Federated Learning for ...

arXiv - AI · 4 min · 7 days ago

Llms

[2603.19565] PFM-VEPAR: Prompting Foundation Models for RGB-Event Camera based Pedestrian Attribute Recognition

Abstract page for arXiv paper 2603.19565: PFM-VEPAR: Prompting Foundation Models for RGB-Event Camera based Pedestrian Attribute Recognition

arXiv - Machine Learning · 4 min · 7 days ago

Machine Learning

[2603.19563] Dual-Domain Representation Alignment: Bridging 2D and 3D Vision via Geometry-Aware Architecture Search

Abstract page for arXiv paper 2603.19563: Dual-Domain Representation Alignment: Bridging 2D and 3D Vision via Geometry-Aware Architecture...

arXiv - AI · 4 min · 7 days ago

Llms

[2603.19531] dinov3.seg: Open-Vocabulary Semantic Segmentation with DINOv3

Abstract page for arXiv paper 2603.19531: dinov3.seg: Open-Vocabulary Semantic Segmentation with DINOv3

arXiv - AI · 4 min · 7 days ago

Llms

[2603.19519] Inducing Sustained Creativity and Diversity in Large Language Models

Abstract page for arXiv paper 2603.19519: Inducing Sustained Creativity and Diversity in Large Language Models

arXiv - AI · 4 min · 7 days ago

Llms

[2603.19470] Adaptive Layerwise Perturbation: Unifying Off-Policy Corrections for LLM RL

Abstract page for arXiv paper 2603.19470: Adaptive Layerwise Perturbation: Unifying Off-Policy Corrections for LLM RL

arXiv - AI · 4 min · 7 days ago

Machine Learning

[2603.19290] Neural Dynamics Self-Attention for Spiking Transformers

Abstract page for arXiv paper 2603.19290: Neural Dynamics Self-Attention for Spiking Transformers

arXiv - AI · 4 min · 7 days ago

Llms

[2603.19289] Speculating Experts Accelerates Inference for Mixture-of-Experts

Abstract page for arXiv paper 2603.19289: Speculating Experts Accelerates Inference for Mixture-of-Experts

arXiv - AI · 3 min · 7 days ago

Previous Page 24 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

[D] thoughts on the controversy about Google's new paper?

[D] MXFP8 GEMM: Up to 99% of cuBLAS performance using CUDA + PTX

UMKC Announces New Master of Science in Artificial Intelligence

All Content

[2603.18048] DEAF: A Benchmark for Diagnostic Evaluation of Acoustic Faithfulness in Audio Language Models

[2511.17038] DAPS++: Rethinking Diffusion Inverse Problems with Decoupled Posterior Annealing

[2509.19464] Evaluation-Aware Reinforcement Learning

[2603.20180] Adaptive Greedy Frame Selection for Long Video Understanding

[2603.20161] Semantic Token Clustering for Efficient Uncertainty Quantification in Large Language Models

[2603.19970] Graph2TS: Structure-Controlled Time Series Generation via Quantile-Graph VAEs

[2603.19957] HiPath: Hierarchical Vision-Language Alignment for Structured Pathology Report Prediction

[2603.19918] Learning Like Humans: Analogical Concept Learning for Generalized Category Discovery

[2603.19757] Uncertainty-aware Prototype Learning with Variational Inference for Few-shot Point Cloud Segmentation

[2603.19664] The Residual Stream Is All You Need: On the Redundancy of the KV Cache in Transformer Inference

[2603.19643] OmniDiT: Extending Diffusion Transformer to Omni-VTON Framework

[2603.19634] MetaCues: Enabling Critical Engagement with Generative AI for Information Seeking and Sensemaking

[2603.19594] ARMOR: Adaptive Resilience Against Model Poisoning Attacks in Continual Federated Learning for Mobile Indoor Localization

[2603.19565] PFM-VEPAR: Prompting Foundation Models for RGB-Event Camera based Pedestrian Attribute Recognition

[2603.19563] Dual-Domain Representation Alignment: Bridging 2D and 3D Vision via Geometry-Aware Architecture Search

[2603.19531] dinov3.seg: Open-Vocabulary Semantic Segmentation with DINOv3

[2603.19519] Inducing Sustained Creativity and Diversity in Large Language Models

[2603.19470] Adaptive Layerwise Perturbation: Unifying Off-Policy Corrections for LLM RL

[2603.19290] Neural Dynamics Self-Attention for Spiking Transformers

[2603.19289] Speculating Experts Accelerates Inference for Mixture-of-Experts

Related Topics

Stay updated with AI News