AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Infrastructure

Integrate Physical AI Capabilities into Existing Apps with NVIDIA Omniverse Libraries

Physical AI—AI systems that perceive, reason, and act in physically grounded simulated environments—is changing how teams design and vali...

AI Tools & Products · 14 min · about 2 hours ago

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 3 hours ago

Ai Infrastructure

Dell and HIVE partner to deploy Nvidia’s next-generation AI chips

AI News - General · 1 min · about 3 hours ago

All Content

Machine Learning

[2602.16690] Synthetic-Powered Multiple Testing with FDR Control

The paper presents SynthBH, a novel method for multiple hypothesis testing that integrates synthetic data to enhance statistical inferenc...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2502.01160] Scalable Precise Computation of Shannon Entropy

This paper presents a scalable tool, PSE, for precise computation of Shannon entropy, optimizing the process to enhance efficiency in qua...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.16660] Align Once, Benefit Multilingually: Enforcing Multilingual Consistency for LLM Safety Alignment

The paper presents a method for enhancing multilingual safety alignment in large language models (LLMs) using a resource-efficient Multi-...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.16612] Causal and Compositional Abstraction

The paper presents a formal framework for causal and compositional abstraction, emphasizing its significance in AI and scientific practic...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.16603] FlowPrefill: Decoupling Preemption from Prefill Scheduling Granularity to Mitigate Head-of-Line Blocking in LLM Serving

The paper presents FlowPrefill, a novel system designed to optimize large language model (LLM) serving by decoupling preemption from sche...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.16585] DataJoint 2.0: A Computational Substrate for Agentic Scientific Workflows

DataJoint 2.0 introduces a relational workflow model designed to enhance collaboration in scientific data pipelines, ensuring data integr...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.16320] RefineFormer3D: Efficient 3D Medical Image Segmentation via Adaptive Multi-Scale Transformer with Cross Attention Fusion

RefineFormer3D presents a lightweight transformer architecture for 3D medical image segmentation, achieving high accuracy with significan...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.16233] DistributedEstimator: Distributed Training of Quantum Neural Networks via Circuit Cutting

The paper presents a novel approach to distributed training of quantum neural networks using circuit cutting, addressing overheads and pe...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.16430] Designing Production-Scale OCR for India: Multilingual and Domain-Specific Systems

This article discusses the development of production-scale Optical Character Recognition (OCR) systems tailored for India's multilingual ...

arXiv - AI · 3 min · about 2 months ago

Nlp

[2602.16148] Local adapt-then-combine algorithms for distributed nonsmooth optimization: Achieving provable communication acceleration

This paper introduces FlexATC, a communication-efficient framework for distributed nonsmooth optimization, achieving notable convergence ...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.16132] CHAI: CacHe Attention Inference for text2video

The paper presents CHAI, a novel approach to enhance text-to-video generation by utilizing Cache Attention for efficient inference, achie...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.16113] Evolutionary Context Search for Automated Skill Acquisition

The paper presents Evolutionary Context Search (ECS), a novel method for automated skill acquisition in large language models, enhancing ...

arXiv - Machine Learning · 3 min · about 2 months ago

Nlp

[2602.16086] LGQ: Learning Discretization Geometry for Scalable and Stable Image Tokenization

The paper presents LGQ, a novel image tokenizer that learns discretization geometry to enhance scalability and stability in visual genera...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.16054] CLAA: Cross-Layer Attention Aggregation for Accelerating LLM Prefill

The paper introduces Cross-Layer Attention Aggregation (CLAA) to enhance the efficiency of long-context LLM inference by addressing token...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.16174] Edge Learning via Federated Split Decision Transformers for Metaverse Resource Allocation

The paper presents Federated Split Decision Transformers (FSDT) for optimizing resource allocation in mobile edge computing for the metav...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.15996] Exploring New Frontiers in Vertical Federated Learning: the Role of Saddle Point Reformulation

This paper explores saddle point reformulation in Vertical Federated Learning (VFL), presenting methods for efficient model training acro...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.16136] Retrieval Collapses When AI Pollutes the Web

The paper discusses the phenomenon of 'Retrieval Collapse,' where AI-generated content dominates search results, leading to a decline in ...

arXiv - AI · 3 min · about 2 months ago

Nlp

[2602.16124] Rethinking ANN-based Retrieval: Multifaceted Learnable Index for Large-scale Recommendation System

The paper presents a novel approach called MultiFaceted Learnable Index (MFLI) for enhancing ANN-based retrieval in large-scale recommend...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.15894] Quality-constrained Entropy Maximization Policy Optimization for LLM Diversity

This paper presents Quality-constrained Entropy Maximization Policy Optimization (QEMPO), a method to enhance diversity in large language...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.15891] Learning to Drive in New Cities Without Human Demonstrations

This paper presents NOMAD, a novel approach for training autonomous vehicles to navigate new cities without relying on human driving demo...

arXiv - Machine Learning · 3 min · about 2 months ago

Previous Page 126 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

Integrate Physical AI Capabilities into Existing Apps with NVIDIA Omniverse Libraries

UMKC Announces New Master of Science in Artificial Intelligence

Dell and HIVE partner to deploy Nvidia’s next-generation AI chips

All Content

[2602.16690] Synthetic-Powered Multiple Testing with FDR Control

[2502.01160] Scalable Precise Computation of Shannon Entropy

[2602.16660] Align Once, Benefit Multilingually: Enforcing Multilingual Consistency for LLM Safety Alignment

[2602.16612] Causal and Compositional Abstraction

[2602.16603] FlowPrefill: Decoupling Preemption from Prefill Scheduling Granularity to Mitigate Head-of-Line Blocking in LLM Serving

[2602.16585] DataJoint 2.0: A Computational Substrate for Agentic Scientific Workflows

[2602.16320] RefineFormer3D: Efficient 3D Medical Image Segmentation via Adaptive Multi-Scale Transformer with Cross Attention Fusion

[2602.16233] DistributedEstimator: Distributed Training of Quantum Neural Networks via Circuit Cutting

[2602.16430] Designing Production-Scale OCR for India: Multilingual and Domain-Specific Systems

[2602.16148] Local adapt-then-combine algorithms for distributed nonsmooth optimization: Achieving provable communication acceleration

[2602.16132] CHAI: CacHe Attention Inference for text2video

[2602.16113] Evolutionary Context Search for Automated Skill Acquisition

[2602.16086] LGQ: Learning Discretization Geometry for Scalable and Stable Image Tokenization

[2602.16054] CLAA: Cross-Layer Attention Aggregation for Accelerating LLM Prefill

[2602.16174] Edge Learning via Federated Split Decision Transformers for Metaverse Resource Allocation

[2602.15996] Exploring New Frontiers in Vertical Federated Learning: the Role of Saddle Point Reformulation

[2602.16136] Retrieval Collapses When AI Pollutes the Web

[2602.16124] Rethinking ANN-based Retrieval: Multifaceted Learnable Index for Large-scale Recommendation System

[2602.15894] Quality-constrained Entropy Maximization Policy Optimization for LLM Diversity

[2602.15891] Learning to Drive in New Cities Without Human Demonstrations

Related Topics

Stay updated with AI News