AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

RSS

Top This Week

Ai Infrastructure

CUDA Proves Nvidia Is a Software Company | WIRED

There’s a deep, forbidding moat that surrounds Nvidia—and it has nothing to do with hardware.

Wired - AI · 9 min · about 4 hours ago

Llms

[2511.02805] MemSearcher: Training LLMs to Reason, Search and Manage Memory via End-to-End Reinforcement Learning

Abstract page for arXiv paper 2511.02805: MemSearcher: Training LLMs to Reason, Search and Manage Memory via End-to-End Reinforcement Lea...

arXiv - AI · 3 min · about 9 hours ago

Llms

[2510.22944] Is Your Prompt Poisoning Code? Defect Induction Rates and Security Mitigation Strategies

Abstract page for arXiv paper 2510.22944: Is Your Prompt Poisoning Code? Defect Induction Rates and Security Mitigation Strategies

arXiv - AI · 4 min · about 9 hours ago

All Content

Machine Learning

[2602.01621] CGF-Softmax: A Cumulant-Based Softmax Reformulation for Efficient Inference under Homomorphic Encryption

Abstract page for arXiv paper 2602.01621: CGF-Softmax: A Cumulant-Based Softmax Reformulation for Efficient Inference under Homomorphic E...

arXiv - Machine Learning · 3 min · about 11 hours ago

Llms

[2601.21839] Test-Time Compute Games

Abstract page for arXiv paper 2601.21839: Test-Time Compute Games

arXiv - AI · 3 min · about 11 hours ago

Llms

[2601.02602] SWaRL: Safeguard Code Watermarking via Reinforcement Learning

Abstract page for arXiv paper 2601.02602: SWaRL: Safeguard Code Watermarking via Reinforcement Learning

arXiv - Machine Learning · 3 min · about 11 hours ago

Machine Learning

[2511.09016] Assumed Density Filtering and Smoothing with Neural Network Surrogate Models

Abstract page for arXiv paper 2511.09016: Assumed Density Filtering and Smoothing with Neural Network Surrogate Models

arXiv - Machine Learning · 3 min · about 11 hours ago

Llms

[2509.25584] Skip-It? Theoretical Conditions for Layer Skipping in Vision-Language Models

Abstract page for arXiv paper 2509.25584: Skip-It? Theoretical Conditions for Layer Skipping in Vision-Language Models

arXiv - AI · 4 min · about 11 hours ago

Llms

[2604.08426] KV Cache Offloading for Context-Intensive Tasks

Abstract page for arXiv paper 2604.08426: KV Cache Offloading for Context-Intensive Tasks

arXiv - AI · 4 min · about 11 hours ago

Llms

[2602.09782] Flexible Entropy Control in RLVR with a Gradient-Preserving Perspective

Abstract page for arXiv paper 2602.09782: Flexible Entropy Control in RLVR with a Gradient-Preserving Perspective

arXiv - AI · 4 min · about 11 hours ago

Llms

[2602.06283] SOCKET: SOft Collision Kernel EsTimator for Sparse Attention

Abstract page for arXiv paper 2602.06283: SOCKET: SOft Collision Kernel EsTimator for Sparse Attention

arXiv - Machine Learning · 3 min · about 11 hours ago

Machine Learning

[2601.15127] DeepFedNAS: Efficient Hardware-Aware Architecture Adaptation for Heterogeneous IoT Federations via Pareto-Guided Supernet Training

Abstract page for arXiv paper 2601.15127: DeepFedNAS: Efficient Hardware-Aware Architecture Adaptation for Heterogeneous IoT Federations ...

arXiv - Machine Learning · 4 min · about 11 hours ago

Machine Learning

[2602.02832] Koopman Autoencoders with Continuous-Time Latent Dynamics for Fluid Dynamics Forecasting

Abstract page for arXiv paper 2602.02832: Koopman Autoencoders with Continuous-Time Latent Dynamics for Fluid Dynamics Forecasting

arXiv - Machine Learning · 4 min · about 11 hours ago

Llms

[2602.01003] ESSAM: A Novel Competitive Evolution Strategies Approach to Reinforcement Learning for Memory Efficient LLMs Fine-Tuning

Abstract page for arXiv paper 2602.01003: ESSAM: A Novel Competitive Evolution Strategies Approach to Reinforcement Learning for Memory E...

arXiv - AI · 4 min · about 11 hours ago

Llms

[2602.00513] Minerva: Reinforcement Learning with Verifiable Rewards for Cyber Threat Intelligence LLMs

Abstract page for arXiv paper 2602.00513: Minerva: Reinforcement Learning with Verifiable Rewards for Cyber Threat Intelligence LLMs

arXiv - Machine Learning · 4 min · about 11 hours ago

Machine Learning

[2601.18681] ART for Diffusion Sampling: A Reinforcement Learning Approach to Timestep Schedule

Abstract page for arXiv paper 2601.18681: ART for Diffusion Sampling: A Reinforcement Learning Approach to Timestep Schedule

arXiv - AI · 4 min · about 11 hours ago

Machine Learning

[2510.01290] ThinKV: Thought-Adaptive KV Cache Compression for Efficient Reasoning Models

Abstract page for arXiv paper 2510.01290: ThinKV: Thought-Adaptive KV Cache Compression for Efficient Reasoning Models

arXiv - Machine Learning · 3 min · about 11 hours ago

Llms

[2509.05276] SpikingBrain: Spiking Brain-inspired Large Models

Abstract page for arXiv paper 2509.05276: SpikingBrain: Spiking Brain-inspired Large Models

arXiv - AI · 4 min · about 11 hours ago

Machine Learning

[2506.14951] Flat Channels to Infinity in Neural Loss Landscapes

Abstract page for arXiv paper 2506.14951: Flat Channels to Infinity in Neural Loss Landscapes

arXiv - AI · 4 min · about 11 hours ago

Machine Learning

[2605.08035] PropSplat: Map-Free RF Field Reconstruction via 3D Gaussian Propagation Splatting

Abstract page for arXiv paper 2605.08035: PropSplat: Map-Free RF Field Reconstruction via 3D Gaussian Propagation Splatting

arXiv - Machine Learning · 4 min · about 11 hours ago

Llms

[2605.07850] MatryoshkaLoRA: Learning Accurate Hierarchical Low-Rank Representations for LLM Fine-Tuning

Abstract page for arXiv paper 2605.07850: MatryoshkaLoRA: Learning Accurate Hierarchical Low-Rank Representations for LLM Fine-Tuning

arXiv - AI · 4 min · about 11 hours ago

Machine Learning

[2605.07908] Statistical inference with belief functions: A survey

Abstract page for arXiv paper 2605.07908: Statistical inference with belief functions: A survey

arXiv - AI · 3 min · about 11 hours ago

Nlp

[2605.07549] Probabilistic Object Detection with Conformal Prediction

Abstract page for arXiv paper 2605.07549: Probabilistic Object Detection with Conformal Prediction

arXiv - Machine Learning · 4 min · about 11 hours ago

Previous Page 3 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

CUDA Proves Nvidia Is a Software Company | WIRED

[2511.02805] MemSearcher: Training LLMs to Reason, Search and Manage Memory via End-to-End Reinforcement Learning

[2510.22944] Is Your Prompt Poisoning Code? Defect Induction Rates and Security Mitigation Strategies

All Content

[2602.01621] CGF-Softmax: A Cumulant-Based Softmax Reformulation for Efficient Inference under Homomorphic Encryption

[2601.21839] Test-Time Compute Games

[2601.02602] SWaRL: Safeguard Code Watermarking via Reinforcement Learning

[2511.09016] Assumed Density Filtering and Smoothing with Neural Network Surrogate Models

[2509.25584] Skip-It? Theoretical Conditions for Layer Skipping in Vision-Language Models

[2604.08426] KV Cache Offloading for Context-Intensive Tasks

[2602.09782] Flexible Entropy Control in RLVR with a Gradient-Preserving Perspective

[2602.06283] SOCKET: SOft Collision Kernel EsTimator for Sparse Attention

[2601.15127] DeepFedNAS: Efficient Hardware-Aware Architecture Adaptation for Heterogeneous IoT Federations via Pareto-Guided Supernet Training

[2602.02832] Koopman Autoencoders with Continuous-Time Latent Dynamics for Fluid Dynamics Forecasting

[2602.01003] ESSAM: A Novel Competitive Evolution Strategies Approach to Reinforcement Learning for Memory Efficient LLMs Fine-Tuning

[2602.00513] Minerva: Reinforcement Learning with Verifiable Rewards for Cyber Threat Intelligence LLMs

[2601.18681] ART for Diffusion Sampling: A Reinforcement Learning Approach to Timestep Schedule

[2510.01290] ThinKV: Thought-Adaptive KV Cache Compression for Efficient Reasoning Models

[2509.05276] SpikingBrain: Spiking Brain-inspired Large Models

[2506.14951] Flat Channels to Infinity in Neural Loss Landscapes

[2605.08035] PropSplat: Map-Free RF Field Reconstruction via 3D Gaussian Propagation Splatting

[2605.07850] MatryoshkaLoRA: Learning Accurate Hierarchical Low-Rank Representations for LLM Fine-Tuning

[2605.07908] Statistical inference with belief functions: A survey

[2605.07549] Probabilistic Object Detection with Conformal Prediction

Related Topics

Stay updated with AI News