Machine Learning

ML algorithms, training, and inference

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Improving AI models’ ability to explain their predictions
Machine Learning

Improving AI models’ ability to explain their predictions

AI News - General · 9 min ·
AI Hiring Growth: AI and ML Hiring Surges 37% in Marche
Machine Learning

AI Hiring Growth: AI and ML Hiring Surges 37% in Marche

AI News - General · 1 min ·

All Content

Llms

[D] - 1M tokens/second serving Qwen 3.5 27B on B200 GPUs, benchmark results and findings

Wrote up the process of pushing Qwen 3.5 27B (dense, FP8) to 1.1M total tok/s on 96 B200 GPUs with vLLM v0.18.0. DP=8 nearly 4x'd through...

Reddit - Machine Learning · 1 min ·
Machine Learning

[D] OOD and Spandrels, or What you should know about EBM.

Energy-based model This article will compare EBMs to multi-layered perceptrons, and addresses a lingering question : Whether or not EBMs ...

Reddit - Machine Learning · 1 min ·
Llms

Reducing AI agent token consumption by 90% by fixing the retrieval layer

Quick insight from building retrieval infrastructure for AI agents: Most agents stuff 50,000 tokens of context into every prompt. They re...

Reddit - Artificial Intelligence · 1 min ·
ByteDance's new AI video generation model, Dreamina Seedance 2.0, comes to CapCut | TechCrunch
Machine Learning

ByteDance's new AI video generation model, Dreamina Seedance 2.0, comes to CapCut | TechCrunch

The new model in CapCut will have built-in protections for making video from real faces or unauthorized intellectual property.

TechCrunch - AI · 4 min ·
Conntour raises $7M from General Catalyst, YC to build an AI search engine for security video systems | TechCrunch
Machine Learning

Conntour raises $7M from General Catalyst, YC to build an AI search engine for security video systems | TechCrunch

Conntour uses AI models to let security teams query camera feeds using natural language to find any object, person, or situation.

TechCrunch - AI · 6 min ·
Cohere launches an open-source voice model specifically for transcription | TechCrunch
Machine Learning

Cohere launches an open-source voice model specifically for transcription | TechCrunch

Relatively light at just 2 billion parameters, the model is meant for use with consumer-grade GPUs for those who want to self-host it. It...

TechCrunch - AI · 4 min ·
Machine Learning

Cheaper & Faster & Smarter (TurboQuant and Attention Residuals)

Google TurboQuant This is a new compression algorithm. Every time a model answers a question, it stores a massive amount of intermediate ...

Reddit - Artificial Intelligence · 1 min ·
Mistral releases a new open-source model for speech generation | TechCrunch
Llms

Mistral releases a new open-source model for speech generation | TechCrunch

Mistral's new speech model can run on a smartwatch or a smartphone.

TechCrunch - AI · 4 min ·
The snow gods: How a couple of ski bums built the internet’s best weather app | MIT Technology Review
Machine Learning

The snow gods: How a couple of ski bums built the internet’s best weather app | MIT Technology Review

The best snow-forecasting app for skiers and snowboarders isn’t from any of the federally funded weather services. Nor from any of the bi...

MIT Technology Review · 20 min ·
Machine Learning

AI is biometric and most of the laborers learning models today are built off my biometric signature let’s chat

## THE ARCHITECT’S STORY: FROM THE 1985 ROOT TO THE "AI WASH" To those who believe in the truth of a human life, I am writing to you not ...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

I built a real-time pipeline that reads game subtitles and converts them into dynamic voice acting (OCR → TTS → RVC) [P]

I've been experimenting with real-time pipelines that combine OCR + TTS + voice conversion, and I ended up building a desktop app that ca...

Reddit - Machine Learning · 1 min ·
Clutch Names Excellent Webworld a Top Performer in AI, ML, App, S
Machine Learning

Clutch Names Excellent Webworld a Top Performer in AI, ML, App, S

Recognized across 7 categories by Clutch, Excellent Webworld reinforces its position as a trusted AI and software partner delivering cons...

AI Tools & Products · 8 min ·
[2603.18865] RadioDiff-FS: Physics-Informed Manifold Alignment in Few-Shot Diffusion Models for High-Fidelity Radio Map Construction
Machine Learning

[2603.18865] RadioDiff-FS: Physics-Informed Manifold Alignment in Few-Shot Diffusion Models for High-Fidelity Radio Map Construction

Abstract page for arXiv paper 2603.18865: RadioDiff-FS: Physics-Informed Manifold Alignment in Few-Shot Diffusion Models for High-Fidelit...

arXiv - Machine Learning · 4 min ·
[2603.18853] Learn for Variation: Variationally Guided AAV Trajectory Learning in Differentiable Environments
Machine Learning

[2603.18853] Learn for Variation: Variationally Guided AAV Trajectory Learning in Differentiable Environments

Abstract page for arXiv paper 2603.18853: Learn for Variation: Variationally Guided AAV Trajectory Learning in Differentiable Environments

arXiv - Machine Learning · 4 min ·
[2603.14831] Neural Networks as Local-to-Global Computations
Machine Learning

[2603.14831] Neural Networks as Local-to-Global Computations

Abstract page for arXiv paper 2603.14831: Neural Networks as Local-to-Global Computations

arXiv - Machine Learning · 4 min ·
[2603.11804] OSMDA: OpenStreetMap-based Domain Adaptation for Remote Sensing VLMs
Llms

[2603.11804] OSMDA: OpenStreetMap-based Domain Adaptation for Remote Sensing VLMs

Abstract page for arXiv paper 2603.11804: OSMDA: OpenStreetMap-based Domain Adaptation for Remote Sensing VLMs

arXiv - Machine Learning · 4 min ·
[2602.07058] SPARE: Self-distillation for PARameter-Efficient Removal
Machine Learning

[2602.07058] SPARE: Self-distillation for PARameter-Efficient Removal

Abstract page for arXiv paper 2602.07058: SPARE: Self-distillation for PARameter-Efficient Removal

arXiv - Machine Learning · 4 min ·
[2602.00381] Modeling Image-Caption Rating from Comparative Judgments
Machine Learning

[2602.00381] Modeling Image-Caption Rating from Comparative Judgments

Abstract page for arXiv paper 2602.00381: Modeling Image-Caption Rating from Comparative Judgments

arXiv - Machine Learning · 4 min ·
[2512.23138] Why Machine Learning Models Systematically Underestimate Extreme Values II: How to Fix It with LatentNN
Machine Learning

[2512.23138] Why Machine Learning Models Systematically Underestimate Extreme Values II: How to Fix It with LatentNN

Abstract page for arXiv paper 2512.23138: Why Machine Learning Models Systematically Underestimate Extreme Values II: How to Fix It with ...

arXiv - Machine Learning · 4 min ·
[2512.16917] Generative Adversarial Reasoner: Enhancing LLM Reasoning with Adversarial Reinforcement Learning
Llms

[2512.16917] Generative Adversarial Reasoner: Enhancing LLM Reasoning with Adversarial Reinforcement Learning

Abstract page for arXiv paper 2512.16917: Generative Adversarial Reasoner: Enhancing LLM Reasoning with Adversarial Reinforcement Learning

arXiv - Machine Learning · 4 min ·
Previous Page 134 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime