AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

mining hardware doing AI training - is the output actually useful

there's this network that launched recently routing crypto mining hardware toward AI training workloads. miners seem happy with the econo...

Reddit - Artificial Intelligence · 1 min · about 2 hours ago

Llms

[2604.01989] Attention at Rest Stays at Rest: Breaking Visual Inertia for Cognitive Hallucination Mitigation

Abstract page for arXiv paper 2604.01989: Attention at Rest Stays at Rest: Breaking Visual Inertia for Cognitive Hallucination Mitigation

arXiv - AI · 4 min · about 8 hours ago

Machine Learning

[2512.18809] FedVideoMAE: Efficient Privacy-Preserving Federated Video Moderation

Abstract page for arXiv paper 2512.18809: FedVideoMAE: Efficient Privacy-Preserving Federated Video Moderation

arXiv - AI · 4 min · about 8 hours ago

All Content

Machine Learning

[2602.07135] Landscaper: Understanding Loss Landscapes Through Multi-Dimensional Topological Analysis

The paper presents Landscaper, an open-source Python package for analyzing loss landscapes in neural networks using multi-dimensional top...

arXiv - AI · 3 min · about 1 month ago

Robotics

[2505.20181] The Problem of Algorithmic Collisions: Mitigating Unforeseen Risks in a Connected World

The paper discusses the systemic risks posed by algorithmic collisions in interconnected AI systems, highlighting the need for improved g...

arXiv - AI · 4 min · about 1 month ago

Llms

[2505.16670] BitHydra: Towards Bit-flip Inference Cost Attack against Large Language Models

The paper presents BitHydra, a framework for executing bit-flip inference cost attacks on large language models (LLMs), demonstrating how...

arXiv - AI · 4 min · about 1 month ago

Llms

[2602.01428] Improving the Trade-off Between Watermark Strength and Speculative Sampling Efficiency for Language Models

This paper explores the balance between watermark strength and speculative sampling efficiency in language models, proposing a new approa...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2602.01289] Gradient-Aligned Calibration for Post-Training Quantization of Diffusion Models

The paper presents a novel method for post-training quantization (PTQ) of diffusion models, addressing inefficiencies in existing calibra...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2505.16547] Find the Fruit: Zero-Shot Sim2Real RL for Occlusion-Aware Plant Manipulation

This paper presents a zero-shot reinforcement learning framework for occlusion-aware plant manipulation, achieving high success rates in ...

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2601.18936] Bi-Level Online Provisioning and Scheduling with Switching Costs and Cross-Level Constraints

This paper explores a bi-level online provisioning and scheduling problem, focusing on network resource allocation with varying constrain...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2601.13851] Inverting Self-Organizing Maps: A Unified Activation-Based Framework

This paper presents a novel framework for inverting Self-Organizing Maps (SOMs) to recover original inputs from activation patterns, intr...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2512.20363] Clust-PSI-PFL: A Population Stability Index Approach for Clustered Non-IID Personalized Federated Learning

The paper presents Clust-PSI-PFL, a novel framework for personalized federated learning that addresses challenges posed by non-IID data t...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2512.17762] Can You Hear Me Now? A Benchmark for Long-Range Graph Propagation

This article introduces ECHO, a benchmark for evaluating long-range graph propagation in graph neural networks (GNNs), addressing a criti...

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2512.12132] Approximation with SiLU Networks: Constant Depth and Exponential Rates for Basic Operations

This paper presents SiLU network constructions that optimize approximation efficiency for basic operations, particularly the square funct...

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2411.11707] Federated Co-tuning Framework for Large and Small Language Models

The paper presents FedCoLLM, a federated co-tuning framework that enhances the performance of both Large Language Models (LLMs) and Small...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2511.20564] E2E-GRec: An End-to-End Joint Training Framework for Graph Neural Networks and Recommender Systems

The paper presents E2E-GRec, a novel end-to-end framework that integrates Graph Neural Networks (GNNs) with recommender systems, addressi...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2410.15173] Uncovering Autoregressive LLM Knowledge of Thematic Fit in Event Representation

This paper explores how autoregressive large language models (LLMs) assess thematic fit in event representation, achieving state-of-the-a...

arXiv - AI · 3 min · about 1 month ago

Ai Infrastructure

[2511.17439] InTAct: Interval-based Task Activation Consolidation for Continual Learning

The paper presents InTAct, a novel method for continual learning that mitigates catastrophic forgetting by using interval-based task acti...

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2407.17412] (PASS) Visual Prompt Locates Good Structure Sparsity through a Recurrent HyperNetwork

The paper presents PASS, a novel algorithmic framework that utilizes visual prompts to enhance structural sparsity in neural networks, im...

arXiv - AI · 4 min · about 1 month ago

Llms

[2511.00958] The Hidden Power of Normalization Layers in Neural Networks: Exponential Capacity Control

This paper explores the theoretical framework behind normalization layers in neural networks, demonstrating their role in controlling cap...

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.08233] Enhancing Reasoning for Diffusion LLMs via Distribution Matching Policy Optimization

This paper presents Distribution Matching Policy Optimization (DMPO), a novel reinforcement learning method aimed at enhancing reasoning ...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.03313] Scaling Laws Revisited: Modeling the Role of Data Quality in Language Model Pretraining

The paper introduces a new dimensionless data-quality parameter for language model pretraining, establishing a quality-aware scaling law ...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2510.03346] KVComm: Enabling Efficient LLM Communication through Selective KV Sharing

The paper introduces KVComm, a novel framework for efficient communication between Large Language Models (LLMs) using selective KV pair s...

arXiv - AI · 4 min · about 1 month ago

Previous Page 91 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

mining hardware doing AI training - is the output actually useful

[2604.01989] Attention at Rest Stays at Rest: Breaking Visual Inertia for Cognitive Hallucination Mitigation

[2512.18809] FedVideoMAE: Efficient Privacy-Preserving Federated Video Moderation

All Content

[2602.07135] Landscaper: Understanding Loss Landscapes Through Multi-Dimensional Topological Analysis

[2505.20181] The Problem of Algorithmic Collisions: Mitigating Unforeseen Risks in a Connected World

[2505.16670] BitHydra: Towards Bit-flip Inference Cost Attack against Large Language Models

[2602.01428] Improving the Trade-off Between Watermark Strength and Speculative Sampling Efficiency for Language Models

[2602.01289] Gradient-Aligned Calibration for Post-Training Quantization of Diffusion Models

[2505.16547] Find the Fruit: Zero-Shot Sim2Real RL for Occlusion-Aware Plant Manipulation

[2601.18936] Bi-Level Online Provisioning and Scheduling with Switching Costs and Cross-Level Constraints

[2601.13851] Inverting Self-Organizing Maps: A Unified Activation-Based Framework

[2512.20363] Clust-PSI-PFL: A Population Stability Index Approach for Clustered Non-IID Personalized Federated Learning

[2512.17762] Can You Hear Me Now? A Benchmark for Long-Range Graph Propagation

[2512.12132] Approximation with SiLU Networks: Constant Depth and Exponential Rates for Basic Operations

[2411.11707] Federated Co-tuning Framework for Large and Small Language Models

[2511.20564] E2E-GRec: An End-to-End Joint Training Framework for Graph Neural Networks and Recommender Systems

[2410.15173] Uncovering Autoregressive LLM Knowledge of Thematic Fit in Event Representation

[2511.17439] InTAct: Interval-based Task Activation Consolidation for Continual Learning

[2407.17412] (PASS) Visual Prompt Locates Good Structure Sparsity through a Recurrent HyperNetwork

[2511.00958] The Hidden Power of Normalization Layers in Neural Networks: Exponential Capacity Control

[2510.08233] Enhancing Reasoning for Diffusion LLMs via Distribution Matching Policy Optimization

[2510.03313] Scaling Laws Revisited: Modeling the Role of Data Quality in Language Model Pretraining

[2510.03346] KVComm: Enabling Efficient LLM Communication through Selective KV Sharing

Related Topics

Stay updated with AI News