AI Infrastructure

GPUs, training clusters, MLOps, and deployment

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

[P] Run Karpathy's Autoresearch for $0.44 instead of $24 — Open-source parallel evolution pipeline on SageMaker Spot

TL;DR: I built an open-source pipeline that runs Karpathy's autoresearch on SageMaker Spot instances — 25 autonomous ML experiments for $...

Reddit - Machine Learning · 1 min · about 2 hours ago

Ai Infrastructure

Nvidia’s Jensen Huang says ‘We’ve achieved AGI.’ But no one can agree on what AGI means.

Why the most important term in tech remains hotly debated.

AI News - General · 18 min · about 2 hours ago

Llms

[P] I trained a language model from scratch for a low resource language and got it running fully on-device on Android (no GPU, demo)

Hi Everybody! I just wanted to share an update on a project I’ve been working on called BULaMU, a family of language models trained (20M,...

Reddit - Machine Learning · 1 min · about 4 hours ago

All Content

Ai Infrastructure

Tech stocks today: Nvidia CEO Jensen Huang suggests end of OpenAI investments, Apple unveils MacBook Neo

All eyes are on Nvidia's fourth quarter results, due after the closing bell on Wednesday, as AI concerns continue to grip markets.

AI News - General · 22 min · 26 days ago

Ai Infrastructure

Nvidia’s Jensen Huang Rules Out $100 Billion OpenAI Investment

submitted by /u/esporx [link] [comments]

Reddit - Artificial Intelligence · 1 min · 26 days ago

Machine Learning

Remote AI/ML Training Roles – Collaboration Opportunity (Up to $1K/Week)- USA only

Hi everyone 👋 If you’re interested in ML-adjacent remote work, there are currently AI training and evaluation roles open. These roles inv...

Reddit - ML Jobs · 1 min · 27 days ago

Llms

Open-source AI tool beats giant LLMs in literature reviews — and gets citations right

A new open-source AI model outperforms major large language models in literature reviews, achieving citation accuracy comparable to human...

AI News - General · 4 min · 27 days ago

Machine Learning

[2510.14894] Secure Sparse Matrix Multiplications and their Applications to Privacy-Preserving Machine Learning

Abstract page for arXiv paper 2510.14894: Secure Sparse Matrix Multiplications and their Applications to Privacy-Preserving Machine Learning

arXiv - Machine Learning · 4 min · 27 days ago

Llms

[2510.08946] Physically Valid Biomolecular Interaction Modeling with Gauss-Seidel Projection

Abstract page for arXiv paper 2510.08946: Physically Valid Biomolecular Interaction Modeling with Gauss-Seidel Projection

arXiv - Machine Learning · 4 min · 27 days ago

Llms

[2510.03215] Cache-to-Cache: Direct Semantic Communication Between Large Language Models

Abstract page for arXiv paper 2510.03215: Cache-to-Cache: Direct Semantic Communication Between Large Language Models

arXiv - Machine Learning · 4 min · 27 days ago

Ai Infrastructure

[2506.08862] StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams

Abstract page for arXiv paper 2506.08862: StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams

arXiv - Machine Learning · 4 min · 27 days ago

Llms

[2505.05619] LiteLMGuard: Seamless and Lightweight On-Device Prompt Filtering for Safeguarding Small Language Models against Quantization-induced Risks and Vulnerabilities

Abstract page for arXiv paper 2505.05619: LiteLMGuard: Seamless and Lightweight On-Device Prompt Filtering for Safeguarding Small Languag...

arXiv - Machine Learning · 4 min · 27 days ago

Llms

[2404.02138] Topic-Based Watermarks for Large Language Models

Abstract page for arXiv paper 2404.02138: Topic-Based Watermarks for Large Language Models

arXiv - Machine Learning · 4 min · 27 days ago

Ai Infrastructure

[2602.06823] AI-Generated Music Detection in Broadcast Monitoring

Abstract page for arXiv paper 2602.06823: AI-Generated Music Detection in Broadcast Monitoring

arXiv - AI · 4 min · 27 days ago

Machine Learning

[2210.09709] Importance Weighting Correction of Regularized Least-Squares for Target Shift

Abstract page for arXiv paper 2210.09709: Importance Weighting Correction of Regularized Least-Squares for Target Shift

arXiv - Machine Learning · 4 min · 27 days ago

Llms

[2511.12832] From Passive to Persuasive: Steering Emotional Nuance in Human-AI Negotiation

Abstract page for arXiv paper 2511.12832: From Passive to Persuasive: Steering Emotional Nuance in Human-AI Negotiation

arXiv - AI · 3 min · 27 days ago

Llms

[2510.14686] xLLM Technical Report

Abstract page for arXiv paper 2510.14686: xLLM Technical Report

arXiv - AI · 4 min · 27 days ago

Llms

[2510.14086] Every Language Model Has a Forgery-Resistant Signature

Abstract page for arXiv paper 2510.14086: Every Language Model Has a Forgery-Resistant Signature

arXiv - AI · 4 min · 27 days ago

Llms

[2510.06084] Spectrum Tuning: Post-Training for Distributional Coverage and In-Context Steerability

Abstract page for arXiv paper 2510.06084: Spectrum Tuning: Post-Training for Distributional Coverage and In-Context Steerability

arXiv - AI · 4 min · 27 days ago

Machine Learning

[2602.00130] On the Relationship Between Representation Geometry and Generalization in Deep Neural Networks

Abstract page for arXiv paper 2602.00130: On the Relationship Between Representation Geometry and Generalization in Deep Neural Networks

arXiv - Machine Learning · 3 min · 27 days ago

$[2509.21091] Best-of-$\infty$ -- Asymptotic Performance of Test-Time Compute$

Llms

[2509.21091] Best-of-$\infty$ -- Asymptotic Performance of Test-Time Compute

Abstract page for arXiv paper 2509.21091: Best-of-$\infty$ -- Asymptotic Performance of Test-Time Compute

arXiv - AI · 3 min · 27 days ago

Llms

[2509.12610] ScaleDoc: Scaling LLM-based Predicates over Large Document Collections

Abstract page for arXiv paper 2509.12610: ScaleDoc: Scaling LLM-based Predicates over Large Document Collections

arXiv - Machine Learning · 4 min · 27 days ago

Ai Infrastructure

[2509.11663] ConEQsA: Concurrent and Asynchronous Embodied Questions Scheduling and Answering

Abstract page for arXiv paper 2509.11663: ConEQsA: Concurrent and Asynchronous Embodied Questions Scheduling and Answering

arXiv - AI · 4 min · 27 days ago

Previous Page 35 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

AI Infrastructure

Top This Week

[P] Run Karpathy's Autoresearch for $0.44 instead of $24 — Open-source parallel evolution pipeline on SageMaker Spot

Nvidia’s Jensen Huang says ‘We’ve achieved AGI.’ But no one can agree on what AGI means.

[P] I trained a language model from scratch for a low resource language and got it running fully on-device on Android (no GPU, demo)

All Content

Tech stocks today: Nvidia CEO Jensen Huang suggests end of OpenAI investments, Apple unveils MacBook Neo

Nvidia’s Jensen Huang Rules Out $100 Billion OpenAI Investment

Remote AI/ML Training Roles – Collaboration Opportunity (Up to $1K/Week)- USA only

Open-source AI tool beats giant LLMs in literature reviews — and gets citations right

[2510.14894] Secure Sparse Matrix Multiplications and their Applications to Privacy-Preserving Machine Learning

[2510.08946] Physically Valid Biomolecular Interaction Modeling with Gauss-Seidel Projection

[2510.03215] Cache-to-Cache: Direct Semantic Communication Between Large Language Models

[2506.08862] StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams

[2505.05619] LiteLMGuard: Seamless and Lightweight On-Device Prompt Filtering for Safeguarding Small Language Models against Quantization-induced Risks and Vulnerabilities

[2404.02138] Topic-Based Watermarks for Large Language Models

[2602.06823] AI-Generated Music Detection in Broadcast Monitoring

[2210.09709] Importance Weighting Correction of Regularized Least-Squares for Target Shift

[2511.12832] From Passive to Persuasive: Steering Emotional Nuance in Human-AI Negotiation

[2510.14686] xLLM Technical Report

[2510.14086] Every Language Model Has a Forgery-Resistant Signature

[2510.06084] Spectrum Tuning: Post-Training for Distributional Coverage and In-Context Steerability

[2602.00130] On the Relationship Between Representation Geometry and Generalization in Deep Neural Networks

[2509.21091] Best-of-$\infty$ -- Asymptotic Performance of Test-Time Compute

[2509.12610] ScaleDoc: Scaling LLM-based Predicates over Large Document Collections

[2509.11663] ConEQsA: Concurrent and Asynchronous Embodied Questions Scheduling and Answering

Related Topics

Stay updated with AI News