AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

The AI Chip War is Just Getting Started

Everyone talks about AI models, but the real bottleneck might be hardware. According to a recent study by Roots Analysis: AI chip market ...

Reddit - Artificial Intelligence · 1 min · 14 minutes ago

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 6 hours ago

Llms

[2603.16430] EngGPT2: Sovereign, Efficient and Open Intelligence

Abstract page for arXiv paper 2603.16430: EngGPT2: Sovereign, Efficient and Open Intelligence

arXiv - AI · 4 min · about 8 hours ago

All Content

Machine Learning

[2510.14894] Secure Sparse Matrix Multiplications and their Applications to Privacy-Preserving Machine Learning

Abstract page for arXiv paper 2510.14894: Secure Sparse Matrix Multiplications and their Applications to Privacy-Preserving Machine Learning

arXiv - Machine Learning · 4 min · 27 days ago

Llms

[2510.08946] Physically Valid Biomolecular Interaction Modeling with Gauss-Seidel Projection

Abstract page for arXiv paper 2510.08946: Physically Valid Biomolecular Interaction Modeling with Gauss-Seidel Projection

arXiv - Machine Learning · 4 min · 27 days ago

Llms

[2510.03215] Cache-to-Cache: Direct Semantic Communication Between Large Language Models

Abstract page for arXiv paper 2510.03215: Cache-to-Cache: Direct Semantic Communication Between Large Language Models

arXiv - Machine Learning · 4 min · 27 days ago

Ai Infrastructure

[2506.08862] StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams

Abstract page for arXiv paper 2506.08862: StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams

arXiv - Machine Learning · 4 min · 27 days ago

Llms

[2505.05619] LiteLMGuard: Seamless and Lightweight On-Device Prompt Filtering for Safeguarding Small Language Models against Quantization-induced Risks and Vulnerabilities

Abstract page for arXiv paper 2505.05619: LiteLMGuard: Seamless and Lightweight On-Device Prompt Filtering for Safeguarding Small Languag...

arXiv - Machine Learning · 4 min · 27 days ago

Llms

[2404.02138] Topic-Based Watermarks for Large Language Models

Abstract page for arXiv paper 2404.02138: Topic-Based Watermarks for Large Language Models

arXiv - Machine Learning · 4 min · 27 days ago

Ai Infrastructure

[2602.06823] AI-Generated Music Detection in Broadcast Monitoring

Abstract page for arXiv paper 2602.06823: AI-Generated Music Detection in Broadcast Monitoring

arXiv - AI · 4 min · 27 days ago

Machine Learning

[2210.09709] Importance Weighting Correction of Regularized Least-Squares for Target Shift

Abstract page for arXiv paper 2210.09709: Importance Weighting Correction of Regularized Least-Squares for Target Shift

arXiv - Machine Learning · 4 min · 27 days ago

Llms

[2511.12832] From Passive to Persuasive: Steering Emotional Nuance in Human-AI Negotiation

Abstract page for arXiv paper 2511.12832: From Passive to Persuasive: Steering Emotional Nuance in Human-AI Negotiation

arXiv - AI · 3 min · 27 days ago

Llms

[2510.14686] xLLM Technical Report

Abstract page for arXiv paper 2510.14686: xLLM Technical Report

arXiv - AI · 4 min · 27 days ago

Llms

[2510.14086] Every Language Model Has a Forgery-Resistant Signature

Abstract page for arXiv paper 2510.14086: Every Language Model Has a Forgery-Resistant Signature

arXiv - AI · 4 min · 27 days ago

Llms

[2510.06084] Spectrum Tuning: Post-Training for Distributional Coverage and In-Context Steerability

Abstract page for arXiv paper 2510.06084: Spectrum Tuning: Post-Training for Distributional Coverage and In-Context Steerability

arXiv - AI · 4 min · 27 days ago

Machine Learning

[2602.00130] On the Relationship Between Representation Geometry and Generalization in Deep Neural Networks

Abstract page for arXiv paper 2602.00130: On the Relationship Between Representation Geometry and Generalization in Deep Neural Networks

arXiv - Machine Learning · 3 min · 27 days ago

$[2509.21091] Best-of-$\infty$ -- Asymptotic Performance of Test-Time Compute$

Llms

[2509.21091] Best-of-$\infty$ -- Asymptotic Performance of Test-Time Compute

Abstract page for arXiv paper 2509.21091: Best-of-$\infty$ -- Asymptotic Performance of Test-Time Compute

arXiv - AI · 3 min · 27 days ago

Llms

[2509.12610] ScaleDoc: Scaling LLM-based Predicates over Large Document Collections

Abstract page for arXiv paper 2509.12610: ScaleDoc: Scaling LLM-based Predicates over Large Document Collections

arXiv - Machine Learning · 4 min · 27 days ago

Ai Infrastructure

[2509.11663] ConEQsA: Concurrent and Asynchronous Embodied Questions Scheduling and Answering

Abstract page for arXiv paper 2509.11663: ConEQsA: Concurrent and Asynchronous Embodied Questions Scheduling and Answering

arXiv - AI · 4 min · 27 days ago

Machine Learning

[2512.00272] WARP: Weight Teleportation for Attack-Resilient Unlearning Protocols

Abstract page for arXiv paper 2512.00272: WARP: Weight Teleportation for Attack-Resilient Unlearning Protocols

arXiv - Machine Learning · 4 min · 27 days ago

Llms

[2506.17871] LLM Probability Concentration: How Alignment Shrinks the Generative Horizon

Abstract page for arXiv paper 2506.17871: LLM Probability Concentration: How Alignment Shrinks the Generative Horizon

arXiv - Machine Learning · 4 min · 27 days ago

Llms

[2510.08646] Mitigating Over-Refusal in Aligned Large Language Models via Inference-Time Activation Energy

Abstract page for arXiv paper 2510.08646: Mitigating Over-Refusal in Aligned Large Language Models via Inference-Time Activation Energy

arXiv - Machine Learning · 4 min · 27 days ago

Ai Infrastructure

[2510.08382] Characterizing the Multiclass Learnability of Forgiving 0-1 Loss Functions

Abstract page for arXiv paper 2510.08382: Characterizing the Multiclass Learnability of Forgiving 0-1 Loss Functions

arXiv - Machine Learning · 3 min · 27 days ago

Previous Page 41 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

The AI Chip War is Just Getting Started

UMKC Announces New Master of Science in Artificial Intelligence

[2603.16430] EngGPT2: Sovereign, Efficient and Open Intelligence

All Content

[2510.14894] Secure Sparse Matrix Multiplications and their Applications to Privacy-Preserving Machine Learning

[2510.08946] Physically Valid Biomolecular Interaction Modeling with Gauss-Seidel Projection

[2510.03215] Cache-to-Cache: Direct Semantic Communication Between Large Language Models

[2506.08862] StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams

[2505.05619] LiteLMGuard: Seamless and Lightweight On-Device Prompt Filtering for Safeguarding Small Language Models against Quantization-induced Risks and Vulnerabilities

[2404.02138] Topic-Based Watermarks for Large Language Models

[2602.06823] AI-Generated Music Detection in Broadcast Monitoring

[2210.09709] Importance Weighting Correction of Regularized Least-Squares for Target Shift

[2511.12832] From Passive to Persuasive: Steering Emotional Nuance in Human-AI Negotiation

[2510.14686] xLLM Technical Report

[2510.14086] Every Language Model Has a Forgery-Resistant Signature

[2510.06084] Spectrum Tuning: Post-Training for Distributional Coverage and In-Context Steerability

[2602.00130] On the Relationship Between Representation Geometry and Generalization in Deep Neural Networks

[2509.21091] Best-of-$\infty$ -- Asymptotic Performance of Test-Time Compute

[2509.12610] ScaleDoc: Scaling LLM-based Predicates over Large Document Collections

[2509.11663] ConEQsA: Concurrent and Asynchronous Embodied Questions Scheduling and Answering

[2512.00272] WARP: Weight Teleportation for Attack-Resilient Unlearning Protocols

[2506.17871] LLM Probability Concentration: How Alignment Shrinks the Generative Horizon

[2510.08646] Mitigating Over-Refusal in Aligned Large Language Models via Inference-Time Activation Energy

[2510.08382] Characterizing the Multiclass Learnability of Forgiving 0-1 Loss Functions

Related Topics

Stay updated with AI News