AI Infrastructure

GPUs, training clusters, MLOps, and deployment

Top This Week

Machine Learning

The AI Chip War is Just Getting Started

Everyone talks about AI models, but the real bottleneck might be hardware. According to a recent study by Roots Analysis: AI chip market ...

Reddit - Artificial Intelligence · 1 min ·
UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
[2603.16430] EngGPT2: Sovereign, Efficient and Open Intelligence
Llms

[2603.16430] EngGPT2: Sovereign, Efficient and Open Intelligence

Abstract page for arXiv paper 2603.16430: EngGPT2: Sovereign, Efficient and Open Intelligence

arXiv - AI · 4 min ·

All Content

[2510.14894] Secure Sparse Matrix Multiplications and their Applications to Privacy-Preserving Machine Learning
Machine Learning

[2510.14894] Secure Sparse Matrix Multiplications and their Applications to Privacy-Preserving Machine Learning

Abstract page for arXiv paper 2510.14894: Secure Sparse Matrix Multiplications and their Applications to Privacy-Preserving Machine Learning

arXiv - Machine Learning · 4 min ·
[2510.08946] Physically Valid Biomolecular Interaction Modeling with Gauss-Seidel Projection
Llms

[2510.08946] Physically Valid Biomolecular Interaction Modeling with Gauss-Seidel Projection

Abstract page for arXiv paper 2510.08946: Physically Valid Biomolecular Interaction Modeling with Gauss-Seidel Projection

arXiv - Machine Learning · 4 min ·
[2510.03215] Cache-to-Cache: Direct Semantic Communication Between Large Language Models
Llms

[2510.03215] Cache-to-Cache: Direct Semantic Communication Between Large Language Models

Abstract page for arXiv paper 2510.03215: Cache-to-Cache: Direct Semantic Communication Between Large Language Models

arXiv - Machine Learning · 4 min ·
[2506.08862] StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams
Ai Infrastructure

[2506.08862] StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams

Abstract page for arXiv paper 2506.08862: StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams

arXiv - Machine Learning · 4 min ·
[2505.05619] LiteLMGuard: Seamless and Lightweight On-Device Prompt Filtering for Safeguarding Small Language Models against Quantization-induced Risks and Vulnerabilities
Llms

[2505.05619] LiteLMGuard: Seamless and Lightweight On-Device Prompt Filtering for Safeguarding Small Language Models against Quantization-induced Risks and Vulnerabilities

Abstract page for arXiv paper 2505.05619: LiteLMGuard: Seamless and Lightweight On-Device Prompt Filtering for Safeguarding Small Languag...

arXiv - Machine Learning · 4 min ·
[2404.02138] Topic-Based Watermarks for Large Language Models
Llms

[2404.02138] Topic-Based Watermarks for Large Language Models

Abstract page for arXiv paper 2404.02138: Topic-Based Watermarks for Large Language Models

arXiv - Machine Learning · 4 min ·
[2602.06823] AI-Generated Music Detection in Broadcast Monitoring
Ai Infrastructure

[2602.06823] AI-Generated Music Detection in Broadcast Monitoring

Abstract page for arXiv paper 2602.06823: AI-Generated Music Detection in Broadcast Monitoring

arXiv - AI · 4 min ·
[2210.09709] Importance Weighting Correction of Regularized Least-Squares for Target Shift
Machine Learning

[2210.09709] Importance Weighting Correction of Regularized Least-Squares for Target Shift

Abstract page for arXiv paper 2210.09709: Importance Weighting Correction of Regularized Least-Squares for Target Shift

arXiv - Machine Learning · 4 min ·
[2511.12832] From Passive to Persuasive: Steering Emotional Nuance in Human-AI Negotiation
Llms

[2511.12832] From Passive to Persuasive: Steering Emotional Nuance in Human-AI Negotiation

Abstract page for arXiv paper 2511.12832: From Passive to Persuasive: Steering Emotional Nuance in Human-AI Negotiation

arXiv - AI · 3 min ·
[2510.14686] xLLM Technical Report
Llms

[2510.14686] xLLM Technical Report

Abstract page for arXiv paper 2510.14686: xLLM Technical Report

arXiv - AI · 4 min ·
[2510.14086] Every Language Model Has a Forgery-Resistant Signature
Llms

[2510.14086] Every Language Model Has a Forgery-Resistant Signature

Abstract page for arXiv paper 2510.14086: Every Language Model Has a Forgery-Resistant Signature

arXiv - AI · 4 min ·
[2510.06084] Spectrum Tuning: Post-Training for Distributional Coverage and In-Context Steerability
Llms

[2510.06084] Spectrum Tuning: Post-Training for Distributional Coverage and In-Context Steerability

Abstract page for arXiv paper 2510.06084: Spectrum Tuning: Post-Training for Distributional Coverage and In-Context Steerability

arXiv - AI · 4 min ·
[2602.00130] On the Relationship Between Representation Geometry and Generalization in Deep Neural Networks
Machine Learning

[2602.00130] On the Relationship Between Representation Geometry and Generalization in Deep Neural Networks

Abstract page for arXiv paper 2602.00130: On the Relationship Between Representation Geometry and Generalization in Deep Neural Networks

arXiv - Machine Learning · 3 min ·
[2509.21091] Best-of-$\infty$ -- Asymptotic Performance of Test-Time Compute
Llms

[2509.21091] Best-of-$\infty$ -- Asymptotic Performance of Test-Time Compute

Abstract page for arXiv paper 2509.21091: Best-of-$\infty$ -- Asymptotic Performance of Test-Time Compute

arXiv - AI · 3 min ·
[2509.12610] ScaleDoc: Scaling LLM-based Predicates over Large Document Collections
Llms

[2509.12610] ScaleDoc: Scaling LLM-based Predicates over Large Document Collections

Abstract page for arXiv paper 2509.12610: ScaleDoc: Scaling LLM-based Predicates over Large Document Collections

arXiv - Machine Learning · 4 min ·
[2509.11663] ConEQsA: Concurrent and Asynchronous Embodied Questions Scheduling and Answering
Ai Infrastructure

[2509.11663] ConEQsA: Concurrent and Asynchronous Embodied Questions Scheduling and Answering

Abstract page for arXiv paper 2509.11663: ConEQsA: Concurrent and Asynchronous Embodied Questions Scheduling and Answering

arXiv - AI · 4 min ·
[2512.00272] WARP: Weight Teleportation for Attack-Resilient Unlearning Protocols
Machine Learning

[2512.00272] WARP: Weight Teleportation for Attack-Resilient Unlearning Protocols

Abstract page for arXiv paper 2512.00272: WARP: Weight Teleportation for Attack-Resilient Unlearning Protocols

arXiv - Machine Learning · 4 min ·
[2506.17871] LLM Probability Concentration: How Alignment Shrinks the Generative Horizon
Llms

[2506.17871] LLM Probability Concentration: How Alignment Shrinks the Generative Horizon

Abstract page for arXiv paper 2506.17871: LLM Probability Concentration: How Alignment Shrinks the Generative Horizon

arXiv - Machine Learning · 4 min ·
[2510.08646] Mitigating Over-Refusal in Aligned Large Language Models via Inference-Time Activation Energy
Llms

[2510.08646] Mitigating Over-Refusal in Aligned Large Language Models via Inference-Time Activation Energy

Abstract page for arXiv paper 2510.08646: Mitigating Over-Refusal in Aligned Large Language Models via Inference-Time Activation Energy

arXiv - Machine Learning · 4 min ·
[2510.08382] Characterizing the Multiclass Learnability of Forgiving 0-1 Loss Functions
Ai Infrastructure

[2510.08382] Characterizing the Multiclass Learnability of Forgiving 0-1 Loss Functions

Abstract page for arXiv paper 2510.08382: Characterizing the Multiclass Learnability of Forgiving 0-1 Loss Functions

arXiv - Machine Learning · 3 min ·
Previous Page 41 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime