Machine Learning

ML algorithms, training, and inference

Top This Week

Machine Learning

Researchers Discover Bias in AI Models That Analyze Pathology Samples

AI News - General ·
UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Role of Artificial Intelligence and Machine Learning in Diagnosing Knee Lesions: Where Are We Now?
Machine Learning

Role of Artificial Intelligence and Machine Learning in Diagnosing Knee Lesions: Where Are We Now?

AI News - General · 2 min ·

All Content

Llms

[D] - 1M tokens/second serving Qwen 3.5 27B on B200 GPUs, benchmark results and findings

Wrote up the process of pushing Qwen 3.5 27B (dense, FP8) to 1.1M total tok/s on 96 B200 GPUs with vLLM v0.18.0. DP=8 nearly 4x'd through...

Reddit - Machine Learning · 1 min ·
Machine Learning

[D] OOD and Spandrels, or What you should know about EBM.

Energy-based model This article will compare EBMs to multi-layered perceptrons, and addresses a lingering question : Whether or not EBMs ...

Reddit - Machine Learning · 1 min ·
Llms

Reducing AI agent token consumption by 90% by fixing the retrieval layer

Quick insight from building retrieval infrastructure for AI agents: Most agents stuff 50,000 tokens of context into every prompt. They re...

Reddit - Artificial Intelligence · 1 min ·
ByteDance's new AI video generation model, Dreamina Seedance 2.0, comes to CapCut | TechCrunch
Machine Learning

ByteDance's new AI video generation model, Dreamina Seedance 2.0, comes to CapCut | TechCrunch

The new model in CapCut will have built-in protections for making video from real faces or unauthorized intellectual property.

TechCrunch - AI · 4 min ·
Conntour raises $7M from General Catalyst, YC to build an AI search engine for security video systems | TechCrunch
Machine Learning

Conntour raises $7M from General Catalyst, YC to build an AI search engine for security video systems | TechCrunch

Conntour uses AI models to let security teams query camera feeds using natural language to find any object, person, or situation.

TechCrunch - AI · 6 min ·
Cohere launches an open-source voice model specifically for transcription | TechCrunch
Machine Learning

Cohere launches an open-source voice model specifically for transcription | TechCrunch

Relatively light at just 2 billion parameters, the model is meant for use with consumer-grade GPUs for those who want to self-host it. It...

TechCrunch - AI · 4 min ·
Machine Learning

Cheaper & Faster & Smarter (TurboQuant and Attention Residuals)

Google TurboQuant This is a new compression algorithm. Every time a model answers a question, it stores a massive amount of intermediate ...

Reddit - Artificial Intelligence · 1 min ·
Mistral releases a new open-source model for speech generation | TechCrunch
Llms

Mistral releases a new open-source model for speech generation | TechCrunch

Mistral's new speech model can run on a smartwatch or a smartphone.

TechCrunch - AI · 4 min ·
The snow gods: How a couple of ski bums built the internet’s best weather app | MIT Technology Review
Machine Learning

The snow gods: How a couple of ski bums built the internet’s best weather app | MIT Technology Review

The best snow-forecasting app for skiers and snowboarders isn’t from any of the federally funded weather services. Nor from any of the bi...

MIT Technology Review · 20 min ·
Machine Learning

AI is biometric and most of the laborers learning models today are built off my biometric signature let’s chat

## THE ARCHITECT’S STORY: FROM THE 1985 ROOT TO THE "AI WASH" To those who believe in the truth of a human life, I am writing to you not ...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

I built a real-time pipeline that reads game subtitles and converts them into dynamic voice acting (OCR → TTS → RVC) [P]

I've been experimenting with real-time pipelines that combine OCR + TTS + voice conversion, and I ended up building a desktop app that ca...

Reddit - Machine Learning · 1 min ·
Clutch Names Excellent Webworld a Top Performer in AI, ML, App, S
Machine Learning

Clutch Names Excellent Webworld a Top Performer in AI, ML, App, S

Recognized across 7 categories by Clutch, Excellent Webworld reinforces its position as a trusted AI and software partner delivering cons...

AI Tools & Products · 8 min ·
[2603.18865] RadioDiff-FS: Physics-Informed Manifold Alignment in Few-Shot Diffusion Models for High-Fidelity Radio Map Construction
Machine Learning

[2603.18865] RadioDiff-FS: Physics-Informed Manifold Alignment in Few-Shot Diffusion Models for High-Fidelity Radio Map Construction

Abstract page for arXiv paper 2603.18865: RadioDiff-FS: Physics-Informed Manifold Alignment in Few-Shot Diffusion Models for High-Fidelit...

arXiv - Machine Learning · 4 min ·
[2603.18853] Learn for Variation: Variationally Guided AAV Trajectory Learning in Differentiable Environments
Machine Learning

[2603.18853] Learn for Variation: Variationally Guided AAV Trajectory Learning in Differentiable Environments

Abstract page for arXiv paper 2603.18853: Learn for Variation: Variationally Guided AAV Trajectory Learning in Differentiable Environments

arXiv - Machine Learning · 4 min ·
[2603.14831] Neural Networks as Local-to-Global Computations
Machine Learning

[2603.14831] Neural Networks as Local-to-Global Computations

Abstract page for arXiv paper 2603.14831: Neural Networks as Local-to-Global Computations

arXiv - Machine Learning · 4 min ·
[2603.11804] OSMDA: OpenStreetMap-based Domain Adaptation for Remote Sensing VLMs
Llms

[2603.11804] OSMDA: OpenStreetMap-based Domain Adaptation for Remote Sensing VLMs

Abstract page for arXiv paper 2603.11804: OSMDA: OpenStreetMap-based Domain Adaptation for Remote Sensing VLMs

arXiv - Machine Learning · 4 min ·
[2602.07058] SPARE: Self-distillation for PARameter-Efficient Removal
Machine Learning

[2602.07058] SPARE: Self-distillation for PARameter-Efficient Removal

Abstract page for arXiv paper 2602.07058: SPARE: Self-distillation for PARameter-Efficient Removal

arXiv - Machine Learning · 4 min ·
[2602.00381] Modeling Image-Caption Rating from Comparative Judgments
Machine Learning

[2602.00381] Modeling Image-Caption Rating from Comparative Judgments

Abstract page for arXiv paper 2602.00381: Modeling Image-Caption Rating from Comparative Judgments

arXiv - Machine Learning · 4 min ·
[2512.23138] Why Machine Learning Models Systematically Underestimate Extreme Values II: How to Fix It with LatentNN
Machine Learning

[2512.23138] Why Machine Learning Models Systematically Underestimate Extreme Values II: How to Fix It with LatentNN

Abstract page for arXiv paper 2512.23138: Why Machine Learning Models Systematically Underestimate Extreme Values II: How to Fix It with ...

arXiv - Machine Learning · 4 min ·
[2512.16917] Generative Adversarial Reasoner: Enhancing LLM Reasoning with Adversarial Reinforcement Learning
Llms

[2512.16917] Generative Adversarial Reasoner: Enhancing LLM Reasoning with Adversarial Reinforcement Learning

Abstract page for arXiv paper 2512.16917: Generative Adversarial Reasoner: Enhancing LLM Reasoning with Adversarial Reinforcement Learning

arXiv - Machine Learning · 4 min ·
Previous Page 183 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime