Machine Learning

ML algorithms, training, and inference

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

Finally Abliterated Sarvam 30B and 105B!

I abliterated Sarvam-30B and 105B - India's first multilingual MoE reasoning models - and found something interesting along the way! Reas...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Machine Learning

BANKING77-77: New best of 94.61% on the official test set (+0.13pp) over our previous tests 94.48%.

Hi everyone, Just wanted to share a small but hard-won milestone. After a long plateau at 94.48%, we’ve pushed the official BANKING77-77 ...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Machine Learning

Free tool I built to score dataset quality (LQS) — feedback welcome [D]

We built a Label Quality Score (LQS) system for our dataset marketplace and opened it up as a free standalone tool. Upload a dataset → ge...

Reddit - Machine Learning · 1 min · about 3 hours ago

All Content

Cohere launches an open-source voice model specifically for transcription | TechCrunch

Machine Learning

Cohere launches an open-source voice model specifically for transcription | TechCrunch

Relatively light at just 2 billion parameters, the model is meant for use with consumer-grade GPUs for those who want to self-host it. It...

TechCrunch - AI · 4 min · 13 days ago

Machine Learning

Cheaper & Faster & Smarter (TurboQuant and Attention Residuals)

Google TurboQuant This is a new compression algorithm. Every time a model answers a question, it stores a massive amount of intermediate ...

Reddit - Artificial Intelligence · 1 min · 13 days ago

Mistral releases a new open-source model for speech generation | TechCrunch

Llms

Mistral releases a new open-source model for speech generation | TechCrunch

Mistral's new speech model can run on a smartwatch or a smartphone.

TechCrunch - AI · 4 min · 13 days ago

The snow gods: How a couple of ski bums built the internet’s best weather app | MIT Technology Review

Machine Learning

The snow gods: How a couple of ski bums built the internet’s best weather app | MIT Technology Review

The best snow-forecasting app for skiers and snowboarders isn’t from any of the federally funded weather services. Nor from any of the bi...

MIT Technology Review · 20 min · 14 days ago

Machine Learning

AI is biometric and most of the laborers learning models today are built off my biometric signature let’s chat

## THE ARCHITECT’S STORY: FROM THE 1985 ROOT TO THE "AI WASH" To those who believe in the truth of a human life, I am writing to you not ...

Reddit - Artificial Intelligence · 1 min · 14 days ago

Machine Learning

I built a real-time pipeline that reads game subtitles and converts them into dynamic voice acting (OCR → TTS → RVC) [P]

I've been experimenting with real-time pipelines that combine OCR + TTS + voice conversion, and I ended up building a desktop app that ca...

Reddit - Machine Learning · 1 min · 14 days ago

Clutch Names Excellent Webworld a Top Performer in AI, ML, App, S

Machine Learning

Clutch Names Excellent Webworld a Top Performer in AI, ML, App, S

Recognized across 7 categories by Clutch, Excellent Webworld reinforces its position as a trusted AI and software partner delivering cons...

AI Tools & Products · 8 min · 14 days ago

Machine Learning

[2603.18865] RadioDiff-FS: Physics-Informed Manifold Alignment in Few-Shot Diffusion Models for High-Fidelity Radio Map Construction

Abstract page for arXiv paper 2603.18865: RadioDiff-FS: Physics-Informed Manifold Alignment in Few-Shot Diffusion Models for High-Fidelit...

arXiv - Machine Learning · 4 min · 14 days ago

Machine Learning

[2603.18853] Learn for Variation: Variationally Guided AAV Trajectory Learning in Differentiable Environments

Abstract page for arXiv paper 2603.18853: Learn for Variation: Variationally Guided AAV Trajectory Learning in Differentiable Environments

arXiv - Machine Learning · 4 min · 14 days ago

Machine Learning

[2603.14831] Neural Networks as Local-to-Global Computations

Abstract page for arXiv paper 2603.14831: Neural Networks as Local-to-Global Computations

arXiv - Machine Learning · 4 min · 14 days ago

Llms

[2603.11804] OSMDA: OpenStreetMap-based Domain Adaptation for Remote Sensing VLMs

Abstract page for arXiv paper 2603.11804: OSMDA: OpenStreetMap-based Domain Adaptation for Remote Sensing VLMs

arXiv - Machine Learning · 4 min · 14 days ago

Machine Learning

[2602.07058] SPARE: Self-distillation for PARameter-Efficient Removal

Abstract page for arXiv paper 2602.07058: SPARE: Self-distillation for PARameter-Efficient Removal

arXiv - Machine Learning · 4 min · 14 days ago

Machine Learning

[2602.00381] Modeling Image-Caption Rating from Comparative Judgments

Abstract page for arXiv paper 2602.00381: Modeling Image-Caption Rating from Comparative Judgments

arXiv - Machine Learning · 4 min · 14 days ago

Machine Learning

[2512.23138] Why Machine Learning Models Systematically Underestimate Extreme Values II: How to Fix It with LatentNN

Abstract page for arXiv paper 2512.23138: Why Machine Learning Models Systematically Underestimate Extreme Values II: How to Fix It with ...

arXiv - Machine Learning · 4 min · 14 days ago

Llms

[2512.16917] Generative Adversarial Reasoner: Enhancing LLM Reasoning with Adversarial Reinforcement Learning

Abstract page for arXiv paper 2512.16917: Generative Adversarial Reasoner: Enhancing LLM Reasoning with Adversarial Reinforcement Learning

arXiv - Machine Learning · 4 min · 14 days ago

Machine Learning

[2512.04000] Divide, then Ground: Adapting Frame Selection to Query Types for Long-Form Video Understanding

Abstract page for arXiv paper 2512.04000: Divide, then Ground: Adapting Frame Selection to Query Types for Long-Form Video Understanding

arXiv - Machine Learning · 4 min · 14 days ago

Machine Learning

[2511.21542] E0: Enhancing Generalization and Fine-Grained Control in VLA Models via Tweedie Discrete Diffusion

Abstract page for arXiv paper 2511.21542: E0: Enhancing Generalization and Fine-Grained Control in VLA Models via Tweedie Discrete Diffusion

arXiv - Machine Learning · 4 min · 14 days ago

Machine Learning

[2511.20888] Deep Learning as a Convex Paradigm of Computation: Minimizing Circuit Size with ResNets

Abstract page for arXiv paper 2511.20888: Deep Learning as a Convex Paradigm of Computation: Minimizing Circuit Size with ResNets

arXiv - Machine Learning · 4 min · 14 days ago

Llms

[2510.12728] Data-Prompt Co-Evolution: Growing Test Sets to Refine LLM Behavior

Abstract page for arXiv paper 2510.12728: Data-Prompt Co-Evolution: Growing Test Sets to Refine LLM Behavior

arXiv - Machine Learning · 4 min · 14 days ago

Llms

[2510.10223] You only need 4 extra tokens: Synergistic Test-time Adaptation for LLMs

Abstract page for arXiv paper 2510.10223: You only need 4 extra tokens: Synergistic Test-time Adaptation for LLMs

arXiv - Machine Learning · 4 min · 14 days ago

Previous Page 153 Next

Related Topics

Large Language Models Generative AI Natural Language Processing Computer Vision Robotics & Embodied AI AI Safety & Ethics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime