Machine Learning

ML algorithms, training, and inference

Top This Week

Llms

TurboOCR: 270–1200 img/s OCR with Paddle + TensorRT (C++/CUDA, FP16) [P]

I had about 940,000 PDFs to process. Running VLMs over a million pages is slow and expensive, and that gap is only getting worse as OCR m...

Reddit - Machine Learning · 1 min ·
Machine Learning

Trained a Qwen2.5-0.5B-Instruct bf16 model on Reddit post summarization task with GRPO [P]

So, a few days back I shared a post where I trained a tiny Qwen2.5-0.5B-Instruct model on smoltldr (reddit post summarization dataset of ...

Reddit - Machine Learning · 1 min ·
Mark Zuckerberg is reportedly building an AI clone to replace him in meetings | The Verge
Machine Learning

Mark Zuckerberg is reportedly building an AI clone to replace him in meetings | The Verge

Meta is working to build an AI version of its CEO Mark Zuckerberg, which he will use to interact with employees, according to a report fr...

The Verge - AI · 4 min ·

All Content

Llms

Reducing AI agent token consumption by 90% by fixing the retrieval layer

Quick insight from building retrieval infrastructure for AI agents: Most agents stuff 50,000 tokens of context into every prompt. They re...

Reddit - Artificial Intelligence · 1 min ·
ByteDance's new AI video generation model, Dreamina Seedance 2.0, comes to CapCut | TechCrunch
Machine Learning

ByteDance's new AI video generation model, Dreamina Seedance 2.0, comes to CapCut | TechCrunch

The new model in CapCut will have built-in protections for making video from real faces or unauthorized intellectual property.

TechCrunch - AI · 4 min ·
Conntour raises $7M from General Catalyst, YC to build an AI search engine for security video systems | TechCrunch
Machine Learning

Conntour raises $7M from General Catalyst, YC to build an AI search engine for security video systems | TechCrunch

Conntour uses AI models to let security teams query camera feeds using natural language to find any object, person, or situation.

TechCrunch - AI · 6 min ·
Cohere launches an open-source voice model specifically for transcription | TechCrunch
Machine Learning

Cohere launches an open-source voice model specifically for transcription | TechCrunch

Relatively light at just 2 billion parameters, the model is meant for use with consumer-grade GPUs for those who want to self-host it. It...

TechCrunch - AI · 4 min ·
Machine Learning

Cheaper & Faster & Smarter (TurboQuant and Attention Residuals)

Google TurboQuant This is a new compression algorithm. Every time a model answers a question, it stores a massive amount of intermediate ...

Reddit - Artificial Intelligence · 1 min ·
Mistral releases a new open-source model for speech generation | TechCrunch
Llms

Mistral releases a new open-source model for speech generation | TechCrunch

Mistral's new speech model can run on a smartwatch or a smartphone.

TechCrunch - AI · 4 min ·
The snow gods: How a couple of ski bums built the internet’s best weather app | MIT Technology Review
Machine Learning

The snow gods: How a couple of ski bums built the internet’s best weather app | MIT Technology Review

The best snow-forecasting app for skiers and snowboarders isn’t from any of the federally funded weather services. Nor from any of the bi...

MIT Technology Review · 20 min ·
Machine Learning

AI is biometric and most of the laborers learning models today are built off my biometric signature let’s chat

## THE ARCHITECT’S STORY: FROM THE 1985 ROOT TO THE "AI WASH" To those who believe in the truth of a human life, I am writing to you not ...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

I built a real-time pipeline that reads game subtitles and converts them into dynamic voice acting (OCR → TTS → RVC) [P]

I've been experimenting with real-time pipelines that combine OCR + TTS + voice conversion, and I ended up building a desktop app that ca...

Reddit - Machine Learning · 1 min ·
Clutch Names Excellent Webworld a Top Performer in AI, ML, App, S
Machine Learning

Clutch Names Excellent Webworld a Top Performer in AI, ML, App, S

Recognized across 7 categories by Clutch, Excellent Webworld reinforces its position as a trusted AI and software partner delivering cons...

AI Tools & Products · 8 min ·
[2603.18865] RadioDiff-FS: Physics-Informed Manifold Alignment in Few-Shot Diffusion Models for High-Fidelity Radio Map Construction
Machine Learning

[2603.18865] RadioDiff-FS: Physics-Informed Manifold Alignment in Few-Shot Diffusion Models for High-Fidelity Radio Map Construction

Abstract page for arXiv paper 2603.18865: RadioDiff-FS: Physics-Informed Manifold Alignment in Few-Shot Diffusion Models for High-Fidelit...

arXiv - Machine Learning · 4 min ·
[2603.18853] Learn for Variation: Variationally Guided AAV Trajectory Learning in Differentiable Environments
Machine Learning

[2603.18853] Learn for Variation: Variationally Guided AAV Trajectory Learning in Differentiable Environments

Abstract page for arXiv paper 2603.18853: Learn for Variation: Variationally Guided AAV Trajectory Learning in Differentiable Environments

arXiv - Machine Learning · 4 min ·
[2603.14831] Neural Networks as Local-to-Global Computations
Machine Learning

[2603.14831] Neural Networks as Local-to-Global Computations

Abstract page for arXiv paper 2603.14831: Neural Networks as Local-to-Global Computations

arXiv - Machine Learning · 4 min ·
[2603.11804] OSMDA: OpenStreetMap-based Domain Adaptation for Remote Sensing VLMs
Llms

[2603.11804] OSMDA: OpenStreetMap-based Domain Adaptation for Remote Sensing VLMs

Abstract page for arXiv paper 2603.11804: OSMDA: OpenStreetMap-based Domain Adaptation for Remote Sensing VLMs

arXiv - Machine Learning · 4 min ·
[2602.07058] SPARE: Self-distillation for PARameter-Efficient Removal
Machine Learning

[2602.07058] SPARE: Self-distillation for PARameter-Efficient Removal

Abstract page for arXiv paper 2602.07058: SPARE: Self-distillation for PARameter-Efficient Removal

arXiv - Machine Learning · 4 min ·
[2602.00381] Modeling Image-Caption Rating from Comparative Judgments
Machine Learning

[2602.00381] Modeling Image-Caption Rating from Comparative Judgments

Abstract page for arXiv paper 2602.00381: Modeling Image-Caption Rating from Comparative Judgments

arXiv - Machine Learning · 4 min ·
[2512.23138] Why Machine Learning Models Systematically Underestimate Extreme Values II: How to Fix It with LatentNN
Machine Learning

[2512.23138] Why Machine Learning Models Systematically Underestimate Extreme Values II: How to Fix It with LatentNN

Abstract page for arXiv paper 2512.23138: Why Machine Learning Models Systematically Underestimate Extreme Values II: How to Fix It with ...

arXiv - Machine Learning · 4 min ·
[2512.16917] Generative Adversarial Reasoner: Enhancing LLM Reasoning with Adversarial Reinforcement Learning
Llms

[2512.16917] Generative Adversarial Reasoner: Enhancing LLM Reasoning with Adversarial Reinforcement Learning

Abstract page for arXiv paper 2512.16917: Generative Adversarial Reasoner: Enhancing LLM Reasoning with Adversarial Reinforcement Learning

arXiv - Machine Learning · 4 min ·
[2512.04000] Divide, then Ground: Adapting Frame Selection to Query Types for Long-Form Video Understanding
Machine Learning

[2512.04000] Divide, then Ground: Adapting Frame Selection to Query Types for Long-Form Video Understanding

Abstract page for arXiv paper 2512.04000: Divide, then Ground: Adapting Frame Selection to Query Types for Long-Form Video Understanding

arXiv - Machine Learning · 4 min ·
[2511.21542] E0: Enhancing Generalization and Fine-Grained Control in VLA Models via Tweedie Discrete Diffusion
Machine Learning

[2511.21542] E0: Enhancing Generalization and Fine-Grained Control in VLA Models via Tweedie Discrete Diffusion

Abstract page for arXiv paper 2511.21542: E0: Enhancing Generalization and Fine-Grained Control in VLA Models via Tweedie Discrete Diffusion

arXiv - Machine Learning · 4 min ·
Previous Page 189 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime