AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 5 hours ago

Llms

[2604.07486] Private Seeds, Public LLMs: Realistic and Privacy-Preserving Synthetic Data Generation

Abstract page for arXiv paper 2604.07486: Private Seeds, Public LLMs: Realistic and Privacy-Preserving Synthetic Data Generation

arXiv - AI · 3 min · about 9 hours ago

Llms

[2601.14477] XD-MAP: Cross-Modal Domain Adaptation via Semantic Parametric Maps for Scalable Training Data Generation

Abstract page for arXiv paper 2601.14477: XD-MAP: Cross-Modal Domain Adaptation via Semantic Parametric Maps for Scalable Training Data G...

arXiv - AI · 4 min · about 9 hours ago

All Content

Llms

[2602.16113] Evolutionary Context Search for Automated Skill Acquisition

The paper presents Evolutionary Context Search (ECS), a novel method for automated skill acquisition in large language models, enhancing ...

arXiv - Machine Learning · 3 min · about 2 months ago

Nlp

[2602.16086] LGQ: Learning Discretization Geometry for Scalable and Stable Image Tokenization

The paper presents LGQ, a novel image tokenizer that learns discretization geometry to enhance scalability and stability in visual genera...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.16054] CLAA: Cross-Layer Attention Aggregation for Accelerating LLM Prefill

The paper introduces Cross-Layer Attention Aggregation (CLAA) to enhance the efficiency of long-context LLM inference by addressing token...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.16174] Edge Learning via Federated Split Decision Transformers for Metaverse Resource Allocation

The paper presents Federated Split Decision Transformers (FSDT) for optimizing resource allocation in mobile edge computing for the metav...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.15996] Exploring New Frontiers in Vertical Federated Learning: the Role of Saddle Point Reformulation

This paper explores saddle point reformulation in Vertical Federated Learning (VFL), presenting methods for efficient model training acro...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.16136] Retrieval Collapses When AI Pollutes the Web

The paper discusses the phenomenon of 'Retrieval Collapse,' where AI-generated content dominates search results, leading to a decline in ...

arXiv - AI · 3 min · about 2 months ago

Nlp

[2602.16124] Rethinking ANN-based Retrieval: Multifaceted Learnable Index for Large-scale Recommendation System

The paper presents a novel approach called MultiFaceted Learnable Index (MFLI) for enhancing ANN-based retrieval in large-scale recommend...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.15894] Quality-constrained Entropy Maximization Policy Optimization for LLM Diversity

This paper presents Quality-constrained Entropy Maximization Policy Optimization (QEMPO), a method to enhance diversity in large language...

arXiv - Machine Learning · 3 min · about 2 months ago

Machine Learning

[2602.15891] Learning to Drive in New Cities Without Human Demonstrations

This paper presents NOMAD, a novel approach for training autonomous vehicles to navigate new cities without relying on human driving demo...

arXiv - Machine Learning · 3 min · about 2 months ago

Robotics

[2602.16005] ODYN: An All-Shifted Non-Interior-Point Method for Quadratic Programming in Robotics and AI

The paper introduces ODYN, a novel non-interior-point method for quadratic programming, designed for efficiency in robotics and AI applic...

arXiv - AI · 3 min · about 2 months ago

Llms

[2602.15874] P-RAG: Prompt-Enhanced Parametric RAG with LoRA and Selective CoT for Biomedical and Multi-Hop QA

The paper introduces P-RAG, a novel hybrid architecture that enhances Retrieval-Augmented Generation (RAG) for biomedical question answer...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.15945] From Tool Orchestration to Code Execution: A Study of MCP Design Choices

This paper explores the design choices of Model Context Protocols (MCPs) and introduces Code Execution MCP (CE-MCP) as a solution to scal...

arXiv - AI · 4 min · about 2 months ago

Llms

[2602.16698] Causality is Key for Interpretability Claims to Generalise

This paper discusses the importance of causality in interpretability research for large language models, highlighting pitfalls in general...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.15919] Generalized Leverage Score for Scalable Assessment of Privacy Vulnerability

The paper presents a method for assessing privacy vulnerability in machine learning models using a generalized leverage score, enabling e...

arXiv - Machine Learning · 3 min · about 2 months ago

Llms

[2602.15902] Doc-to-LoRA: Learning to Instantly Internalize Contexts

The paper presents Doc-to-LoRA, a hypernetwork that enables Large Language Models to internalize contexts efficiently, reducing memory us...

arXiv - AI · 3 min · about 2 months ago

Machine Learning

[2602.16596] Sequential Membership Inference Attacks

The paper presents a novel approach to Membership Inference Attacks (MIAs) by developing an optimal attack strategy, SeMI*, leveraging mo...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.15889] Evidence for Daily and Weekly Periodic Variability in GPT-4o Performance

This article investigates the temporal variability in the performance of the GPT-4o model, revealing significant daily and weekly pattern...

arXiv - AI · 4 min · about 2 months ago

Machine Learning

[2602.15888] NeuroSleep: Neuromorphic Event-Driven Single-Channel EEG Sleep Staging for Edge-Efficient Sensing

NeuroSleep presents a neuromorphic event-driven system for efficient EEG sleep staging, achieving high accuracy with reduced computationa...

arXiv - Machine Learning · 4 min · about 2 months ago

Machine Learning

[2602.16570] Steering diffusion models with quadratic rewards: a fine-grained analysis

This article presents a detailed analysis of sampling from reward-tilted diffusion models, focusing on quadratic rewards and their comput...

arXiv - Machine Learning · 4 min · about 2 months ago

Llms

[2602.15862] Enhancing Action and Ingredient Modeling for Semantically Grounded Recipe Generation

This paper presents a novel framework for improving recipe generation from food images by enhancing action and ingredient modeling, addre...

arXiv - AI · 3 min · about 2 months ago

Previous Page 143 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence

[2604.07486] Private Seeds, Public LLMs: Realistic and Privacy-Preserving Synthetic Data Generation

[2601.14477] XD-MAP: Cross-Modal Domain Adaptation via Semantic Parametric Maps for Scalable Training Data Generation

All Content

[2602.16113] Evolutionary Context Search for Automated Skill Acquisition

[2602.16086] LGQ: Learning Discretization Geometry for Scalable and Stable Image Tokenization

[2602.16054] CLAA: Cross-Layer Attention Aggregation for Accelerating LLM Prefill

[2602.16174] Edge Learning via Federated Split Decision Transformers for Metaverse Resource Allocation

[2602.15996] Exploring New Frontiers in Vertical Federated Learning: the Role of Saddle Point Reformulation

[2602.16136] Retrieval Collapses When AI Pollutes the Web

[2602.16124] Rethinking ANN-based Retrieval: Multifaceted Learnable Index for Large-scale Recommendation System

[2602.15894] Quality-constrained Entropy Maximization Policy Optimization for LLM Diversity

[2602.15891] Learning to Drive in New Cities Without Human Demonstrations

[2602.16005] ODYN: An All-Shifted Non-Interior-Point Method for Quadratic Programming in Robotics and AI

[2602.15874] P-RAG: Prompt-Enhanced Parametric RAG with LoRA and Selective CoT for Biomedical and Multi-Hop QA

[2602.15945] From Tool Orchestration to Code Execution: A Study of MCP Design Choices

[2602.16698] Causality is Key for Interpretability Claims to Generalise

[2602.15919] Generalized Leverage Score for Scalable Assessment of Privacy Vulnerability

[2602.15902] Doc-to-LoRA: Learning to Instantly Internalize Contexts

[2602.16596] Sequential Membership Inference Attacks

[2602.15889] Evidence for Daily and Weekly Periodic Variability in GPT-4o Performance

[2602.15888] NeuroSleep: Neuromorphic Event-Driven Single-Channel EEG Sleep Staging for Edge-Efficient Sensing

[2602.16570] Steering diffusion models with quadratic rewards: a fine-grained analysis

[2602.15862] Enhancing Action and Ingredient Modeling for Semantically Grounded Recipe Generation

Related Topics

Stay updated with AI News