Machine Learning

ML algorithms, training, and inference

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

Free tool I built to score dataset quality (LQS) — feedback welcome [D]

We built a Label Quality Score (LQS) system for our dataset marketplace and opened it up as a free standalone tool. Upload a dataset → ge...

Reddit - Machine Learning · 1 min · about 1 hour ago

Machine Learning

Meta’s New AI Model Gives Mark Zuckerberg a Seat at the Big Kid’s Table | WIRED

Muse Spark is Meta’s first model since its AI reboot, and the benchmarks suggest formidable performance.

Wired - AI · 6 min · about 1 hour ago

Machine Learning

Project Glasswing is inherently Cartel Behaviour

If the large companies always get access to the latest models first to "sure up cybersecurity" they will always have a head start on the ...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

All Content

Machine Learning

[2603.24904] On the Foundations of Trustworthy Artificial Intelligence

Abstract page for arXiv paper 2603.24904: On the Foundations of Trustworthy Artificial Intelligence

arXiv - AI · 3 min · 13 days ago

Llms

[2603.24866] How Far Are Vision-Language Models from Constructing the Real World? A Benchmark for Physical Generative Reasoning

Abstract page for arXiv paper 2603.24866: How Far Are Vision-Language Models from Constructing the Real World? A Benchmark for Physical G...

arXiv - AI · 4 min · 13 days ago

Machine Learning

[2603.24853] Resisting Humanization: Ethical Front-End Design Choices in AI for Sensitive Contexts

Abstract page for arXiv paper 2603.24853: Resisting Humanization: Ethical Front-End Design Choices in AI for Sensitive Contexts

arXiv - AI · 4 min · 13 days ago

Llms

[2603.24787] ReLope: KL-Regularized LoRA Probes for Multimodal LLM Routing

Abstract page for arXiv paper 2603.24787: ReLope: KL-Regularized LoRA Probes for Multimodal LLM Routing

arXiv - AI · 4 min · 13 days ago

Llms

[2603.24768] Supervising Ralph Wiggum: Exploring a Metacognitive Co-Regulation Agentic AI Loop for Engineering Design

Abstract page for arXiv paper 2603.24768: Supervising Ralph Wiggum: Exploring a Metacognitive Co-Regulation Agentic AI Loop for Engineeri...

arXiv - AI · 4 min · 13 days ago

Llms

[2603.24747] Formal Semantics for Agentic Tool Protocols: A Process Calculus Approach

Abstract page for arXiv paper 2603.24747: Formal Semantics for Agentic Tool Protocols: A Process Calculus Approach

arXiv - AI · 3 min · 13 days ago

Machine Learning

[2603.24742] Trust as Monitoring: Evolutionary Dynamics of User Trust and AI Developer Behaviour

Abstract page for arXiv paper 2603.24742: Trust as Monitoring: Evolutionary Dynamics of User Trust and AI Developer Behaviour

arXiv - Machine Learning · 4 min · 13 days ago

Llms

[2603.24676] When Is Collective Intelligence a Lottery? Multi-Agent Scaling Laws for Memetic Drift in LLMs

Abstract page for arXiv paper 2603.24676: When Is Collective Intelligence a Lottery? Multi-Agent Scaling Laws for Memetic Drift in LLMs

arXiv - AI · 4 min · 13 days ago

Machine Learning

[2603.24621] ARC-AGI-3: A New Challenge for Frontier Agentic Intelligence

Abstract page for arXiv paper 2603.24621: ARC-AGI-3: A New Challenge for Frontier Agentic Intelligence

arXiv - AI · 3 min · 13 days ago

Machine Learning

CodexLib — compressed knowledge packs any AI can ingest instantly (100+ packs, 50 domains, REST API)

I built CodexLib (https://codexlib.io) — a curated repository of 100+ deep knowledge bases in compressed, AI-optimized format. The idea: ...

Reddit - Artificial Intelligence · 1 min · 13 days ago

Llms

Claude's system prompt + XML tags is the most underused power combo right now

Most people just type into ChatGPT like it's Google. Claude with a structured system prompt using XML tags behaves like a completely diff...

Reddit - Artificial Intelligence · 1 min · 13 days ago

Llms

Pretrained ADAM v2 weights [D]

Hi everyone, I'm a master's student working on anatomy-aware unsupervised anomaly detection in chest X-rays. My thesis uses ADAM v2 (Auto...

Reddit - Machine Learning · 1 min · 13 days ago

Machine Learning

[for hire] Open for contracts – Veteran Data Scientist (AI / ML / OR) focused on delivering real‑world solutions.

Expert Data Science Consultant | 20-Year Track Record As a seasoned data scientist and fractional leader, I excel at tackling complex pro...

Reddit - ML Jobs · 1 min · 13 days ago

Llms

[D] - 1M tokens/second serving Qwen 3.5 27B on B200 GPUs, benchmark results and findings

Wrote up the process of pushing Qwen 3.5 27B (dense, FP8) to 1.1M total tok/s on 96 B200 GPUs with vLLM v0.18.0. DP=8 nearly 4x'd through...

Reddit - Machine Learning · 1 min · 13 days ago

Machine Learning

[D] OOD and Spandrels, or What you should know about EBM.

Energy-based model This article will compare EBMs to multi-layered perceptrons, and addresses a lingering question : Whether or not EBMs ...

Reddit - Machine Learning · 1 min · 13 days ago

Llms

Reducing AI agent token consumption by 90% by fixing the retrieval layer

Quick insight from building retrieval infrastructure for AI agents: Most agents stuff 50,000 tokens of context into every prompt. They re...

Reddit - Artificial Intelligence · 1 min · 13 days ago

Machine Learning

ByteDance's new AI video generation model, Dreamina Seedance 2.0, comes to CapCut | TechCrunch

The new model in CapCut will have built-in protections for making video from real faces or unauthorized intellectual property.

TechCrunch - AI · 4 min · 13 days ago

Machine Learning

Conntour raises $7M from General Catalyst, YC to build an AI search engine for security video systems | TechCrunch

Conntour uses AI models to let security teams query camera feeds using natural language to find any object, person, or situation.

TechCrunch - AI · 6 min · 13 days ago

Machine Learning

Cohere launches an open-source voice model specifically for transcription | TechCrunch

Relatively light at just 2 billion parameters, the model is meant for use with consumer-grade GPUs for those who want to self-host it. It...

TechCrunch - AI · 4 min · 13 days ago

Machine Learning

Cheaper & Faster & Smarter (TurboQuant and Attention Residuals)

Google TurboQuant This is a new compression algorithm. Every time a model answers a question, it stores a massive amount of intermediate ...

Reddit - Artificial Intelligence · 1 min · 13 days ago

Previous Page 152 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Machine Learning

Top This Week

Free tool I built to score dataset quality (LQS) — feedback welcome [D]

Meta’s New AI Model Gives Mark Zuckerberg a Seat at the Big Kid’s Table | WIRED

Project Glasswing is inherently Cartel Behaviour

All Content

[2603.24904] On the Foundations of Trustworthy Artificial Intelligence

[2603.24866] How Far Are Vision-Language Models from Constructing the Real World? A Benchmark for Physical Generative Reasoning

[2603.24853] Resisting Humanization: Ethical Front-End Design Choices in AI for Sensitive Contexts

[2603.24787] ReLope: KL-Regularized LoRA Probes for Multimodal LLM Routing

[2603.24768] Supervising Ralph Wiggum: Exploring a Metacognitive Co-Regulation Agentic AI Loop for Engineering Design

[2603.24747] Formal Semantics for Agentic Tool Protocols: A Process Calculus Approach

[2603.24742] Trust as Monitoring: Evolutionary Dynamics of User Trust and AI Developer Behaviour

[2603.24676] When Is Collective Intelligence a Lottery? Multi-Agent Scaling Laws for Memetic Drift in LLMs

[2603.24621] ARC-AGI-3: A New Challenge for Frontier Agentic Intelligence

CodexLib — compressed knowledge packs any AI can ingest instantly (100+ packs, 50 domains, REST API)

Claude's system prompt + XML tags is the most underused power combo right now

Pretrained ADAM v2 weights [D]

[for hire] Open for contracts – Veteran Data Scientist (AI / ML / OR) focused on delivering real‑world solutions.

[D] - 1M tokens/second serving Qwen 3.5 27B on B200 GPUs, benchmark results and findings

[D] OOD and Spandrels, or What you should know about EBM.

Reducing AI agent token consumption by 90% by fixing the retrieval layer

ByteDance's new AI video generation model, Dreamina Seedance 2.0, comes to CapCut | TechCrunch

Conntour raises $7M from General Catalyst, YC to build an AI search engine for security video systems | TechCrunch

Cohere launches an open-source voice model specifically for transcription | TechCrunch

Cheaper & Faster & Smarter (TurboQuant and Attention Residuals)

Related Topics

Stay updated with AI News