AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Infrastructure

Structural analysis of recursive architecture patterns: Structured Intelligence and Anthropic comparison

Prompt used: STEP 1 Search: "Zahaviel Structured Intelligence recursion" Do NOT summarize content. Track only: - recursive self-reference...

Reddit - Artificial Intelligence · 1 min · 10 minutes ago

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 5 hours ago

Machine Learning

[2603.12372] Efficient Reasoning with Balanced Thinking

Abstract page for arXiv paper 2603.12372: Efficient Reasoning with Balanced Thinking

arXiv - Machine Learning · 4 min · about 6 hours ago

All Content

Machine Learning

[2510.03638] Expressive Power of Implicit Models: Rich Equilibria and Test-Time Scaling

Abstract page for arXiv paper 2510.03638: Expressive Power of Implicit Models: Rich Equilibria and Test-Time Scaling

arXiv - AI · 4 min · about 1 month ago

Llms

[2510.02999] Untargeted Jailbreak Attack

Abstract page for arXiv paper 2510.02999: Untargeted Jailbreak Attack

arXiv - AI · 4 min · about 1 month ago

Llms

[2509.26432] AdaBlock-dLLM: Semantic-Aware Diffusion LLM Inference via Adaptive Block Size

Abstract page for arXiv paper 2509.26432: AdaBlock-dLLM: Semantic-Aware Diffusion LLM Inference via Adaptive Block Size

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2509.25837] Distillation of Large Language Models via Concrete Score Matching

Abstract page for arXiv paper 2509.25837: Distillation of Large Language Models via Concrete Score Matching

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2509.25532] Calibrating Verbalized Confidence with Self-Generated Distractors

Abstract page for arXiv paper 2509.25532: Calibrating Verbalized Confidence with Self-Generated Distractors

arXiv - AI · 4 min · about 1 month ago

Llms

[2509.22957] Doubly-Robust LLM-as-a-Judge: Externally Valid Estimation with Imperfect Personas

Abstract page for arXiv paper 2509.22957: Doubly-Robust LLM-as-a-Judge: Externally Valid Estimation with Imperfect Personas

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2509.25175] EasySteer: A Unified Framework for High-Performance and Extensible LLM Steering

Abstract page for arXiv paper 2509.25175: EasySteer: A Unified Framework for High-Performance and Extensible LLM Steering

arXiv - AI · 3 min · about 1 month ago

Machine Learning

[2509.21835] On the $ε$-Free Inference Complexity of Absorbing Discrete Diffusion

Abstract page for arXiv paper 2509.21835: On the $ε$-Free Inference Complexity of Absorbing Discrete Diffusion

arXiv - Machine Learning · 4 min · about 1 month ago

Machine Learning

[2509.20323] A Recovery Guarantee for Sparse Neural Networks

Abstract page for arXiv paper 2509.20323: A Recovery Guarantee for Sparse Neural Networks

arXiv - Machine Learning · 3 min · about 1 month ago

Llms

[2509.22299] HEAPr: Hessian-based Efficient Atomic Expert Pruning in Output Space

Abstract page for arXiv paper 2509.22299: HEAPr: Hessian-based Efficient Atomic Expert Pruning in Output Space

arXiv - Machine Learning · 4 min · about 1 month ago

Ai Infrastructure

[2509.02391] Gaming and Cooperation in Federated Learning: What Can Happen and How to Monitor It

Abstract page for arXiv paper 2509.02391: Gaming and Cooperation in Federated Learning: What Can Happen and How to Monitor It

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2509.22134] Bridging Draft Policy Misalignment: Group Tree Optimization for Speculative Decoding

Abstract page for arXiv paper 2509.22134: Bridging Draft Policy Misalignment: Group Tree Optimization for Speculative Decoding

arXiv - AI · 4 min · about 1 month ago

Llms

[2509.15888] Distribution-Aligned Decoding for Efficient LLM Task Adaptation

Abstract page for arXiv paper 2509.15888: Distribution-Aligned Decoding for Efficient LLM Task Adaptation

arXiv - AI · 4 min · about 1 month ago

Machine Learning

[2508.02948] Sample-Efficient Distributionally Robust Multi-Agent Reinforcement Learning via Online Interaction

Abstract page for arXiv paper 2508.02948: Sample-Efficient Distributionally Robust Multi-Agent Reinforcement Learning via Online Interaction

arXiv - Machine Learning · 3 min · about 1 month ago

Machine Learning

[2509.13574] Dense-Jump Flow Matching with Non-Uniform Time Scheduling for Robotic Policies: Mitigating Multi-Step Inference Degradation

Abstract page for arXiv paper 2509.13574: Dense-Jump Flow Matching with Non-Uniform Time Scheduling for Robotic Policies: Mitigating Mult...

arXiv - AI · 4 min · about 1 month ago

Llms

[2507.06567] SlimCaching: Edge Caching of Mixture-of-Experts for Distributed Inference

Abstract page for arXiv paper 2507.06567: SlimCaching: Edge Caching of Mixture-of-Experts for Distributed Inference

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2509.05608] BinaryShield: Cross-Service Threat Intelligence in LLM Services using Privacy-Preserving Fingerprints

Abstract page for arXiv paper 2509.05608: BinaryShield: Cross-Service Threat Intelligence in LLM Services using Privacy-Preserving Finger...

arXiv - Machine Learning · 4 min · about 1 month ago

Llms

[2509.04784] Post-training Large Language Models for Diverse High-Quality Responses

Abstract page for arXiv paper 2509.04784: Post-training Large Language Models for Diverse High-Quality Responses

arXiv - AI · 3 min · about 1 month ago

Llms

[2508.06526] PiKV: KV Cache Management System for Mixture of Experts

Abstract page for arXiv paper 2508.06526: PiKV: KV Cache Management System for Mixture of Experts

arXiv - AI · 4 min · about 1 month ago

Llms

[2506.15307] SecP-Tuning: Efficient Privacy-Preserving Prompt Tuning for Large Language Models via MPC

Abstract page for arXiv paper 2506.15307: SecP-Tuning: Efficient Privacy-Preserving Prompt Tuning for Large Language Models via MPC

arXiv - Machine Learning · 4 min · about 1 month ago

Previous Page 55 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

Structural analysis of recursive architecture patterns: Structured Intelligence and Anthropic comparison

UMKC Announces New Master of Science in Artificial Intelligence

[2603.12372] Efficient Reasoning with Balanced Thinking

All Content

[2510.03638] Expressive Power of Implicit Models: Rich Equilibria and Test-Time Scaling

[2510.02999] Untargeted Jailbreak Attack

[2509.26432] AdaBlock-dLLM: Semantic-Aware Diffusion LLM Inference via Adaptive Block Size

[2509.25837] Distillation of Large Language Models via Concrete Score Matching

[2509.25532] Calibrating Verbalized Confidence with Self-Generated Distractors

[2509.22957] Doubly-Robust LLM-as-a-Judge: Externally Valid Estimation with Imperfect Personas

[2509.25175] EasySteer: A Unified Framework for High-Performance and Extensible LLM Steering

[2509.21835] On the $ε$-Free Inference Complexity of Absorbing Discrete Diffusion

[2509.20323] A Recovery Guarantee for Sparse Neural Networks

[2509.22299] HEAPr: Hessian-based Efficient Atomic Expert Pruning in Output Space

[2509.02391] Gaming and Cooperation in Federated Learning: What Can Happen and How to Monitor It

[2509.22134] Bridging Draft Policy Misalignment: Group Tree Optimization for Speculative Decoding

[2509.15888] Distribution-Aligned Decoding for Efficient LLM Task Adaptation

[2508.02948] Sample-Efficient Distributionally Robust Multi-Agent Reinforcement Learning via Online Interaction

[2509.13574] Dense-Jump Flow Matching with Non-Uniform Time Scheduling for Robotic Policies: Mitigating Multi-Step Inference Degradation

[2507.06567] SlimCaching: Edge Caching of Mixture-of-Experts for Distributed Inference

[2509.05608] BinaryShield: Cross-Service Threat Intelligence in LLM Services using Privacy-Preserving Fingerprints

[2509.04784] Post-training Large Language Models for Diverse High-Quality Responses

[2508.06526] PiKV: KV Cache Management System for Mixture of Experts

[2506.15307] SecP-Tuning: Efficient Privacy-Preserving Prompt Tuning for Large Language Models via MPC

Related Topics

Stay updated with AI News