Machine Learning

ML algorithms, training, and inference

Top This Week

[2602.10370] Causal Effect Estimation with Learned Instrument Representations
Machine Learning

[2602.10370] Causal Effect Estimation with Learned Instrument Representations

Abstract page for arXiv paper 2602.10370: Causal Effect Estimation with Learned Instrument Representations

arXiv - Machine Learning · 4 min ·
[2602.04728] Scalable Cross-Attention Transformer for Cooperative Multi-AP OFDM Uplink Reception
Machine Learning

[2602.04728] Scalable Cross-Attention Transformer for Cooperative Multi-AP OFDM Uplink Reception

Abstract page for arXiv paper 2602.04728: Scalable Cross-Attention Transformer for Cooperative Multi-AP OFDM Uplink Reception

arXiv - Machine Learning · 3 min ·
[2602.00913] Do Schwartz Higher-Order Values Help Sentence-Level Human Value Detection? A Study of Hierarchical Gating and Calibration
Machine Learning

[2602.00913] Do Schwartz Higher-Order Values Help Sentence-Level Human Value Detection? A Study of Hierarchical Gating and Calibration

Abstract page for arXiv paper 2602.00913: Do Schwartz Higher-Order Values Help Sentence-Level Human Value Detection? A Study of Hierarchi...

arXiv - Machine Learning · 4 min ·

All Content

[2603.24768] Supervising Ralph Wiggum: Exploring a Metacognitive Co-Regulation Agentic AI Loop for Engineering Design
Llms

[2603.24768] Supervising Ralph Wiggum: Exploring a Metacognitive Co-Regulation Agentic AI Loop for Engineering Design

Abstract page for arXiv paper 2603.24768: Supervising Ralph Wiggum: Exploring a Metacognitive Co-Regulation Agentic AI Loop for Engineeri...

arXiv - AI · 4 min ·
[2603.24747] Formal Semantics for Agentic Tool Protocols: A Process Calculus Approach
Llms

[2603.24747] Formal Semantics for Agentic Tool Protocols: A Process Calculus Approach

Abstract page for arXiv paper 2603.24747: Formal Semantics for Agentic Tool Protocols: A Process Calculus Approach

arXiv - AI · 3 min ·
[2603.24742] Trust as Monitoring: Evolutionary Dynamics of User Trust and AI Developer Behaviour
Machine Learning

[2603.24742] Trust as Monitoring: Evolutionary Dynamics of User Trust and AI Developer Behaviour

Abstract page for arXiv paper 2603.24742: Trust as Monitoring: Evolutionary Dynamics of User Trust and AI Developer Behaviour

arXiv - Machine Learning · 4 min ·
[2603.24676] When Is Collective Intelligence a Lottery? Multi-Agent Scaling Laws for Memetic Drift in LLMs
Llms

[2603.24676] When Is Collective Intelligence a Lottery? Multi-Agent Scaling Laws for Memetic Drift in LLMs

Abstract page for arXiv paper 2603.24676: When Is Collective Intelligence a Lottery? Multi-Agent Scaling Laws for Memetic Drift in LLMs

arXiv - AI · 4 min ·
[2603.24621] ARC-AGI-3: A New Challenge for Frontier Agentic Intelligence
Machine Learning

[2603.24621] ARC-AGI-3: A New Challenge for Frontier Agentic Intelligence

Abstract page for arXiv paper 2603.24621: ARC-AGI-3: A New Challenge for Frontier Agentic Intelligence

arXiv - AI · 3 min ·
Machine Learning

CodexLib — compressed knowledge packs any AI can ingest instantly (100+ packs, 50 domains, REST API)

I built CodexLib (https://codexlib.io) — a curated repository of 100+ deep knowledge bases in compressed, AI-optimized format. The idea: ...

Reddit - Artificial Intelligence · 1 min ·
Llms

Claude's system prompt + XML tags is the most underused power combo right now

Most people just type into ChatGPT like it's Google. Claude with a structured system prompt using XML tags behaves like a completely diff...

Reddit - Artificial Intelligence · 1 min ·
Llms

Pretrained ADAM v2 weights [D]

Hi everyone, I'm a master's student working on anatomy-aware unsupervised anomaly detection in chest X-rays. My thesis uses ADAM v2 (Auto...

Reddit - Machine Learning · 1 min ·
Machine Learning

[for hire] Open for contracts – Veteran Data Scientist (AI / ML / OR) focused on delivering real‑world solutions.

Expert Data Science Consultant | 20-Year Track Record As a seasoned data scientist and fractional leader, I excel at tackling complex pro...

Reddit - ML Jobs · 1 min ·
Llms

[D] - 1M tokens/second serving Qwen 3.5 27B on B200 GPUs, benchmark results and findings

Wrote up the process of pushing Qwen 3.5 27B (dense, FP8) to 1.1M total tok/s on 96 B200 GPUs with vLLM v0.18.0. DP=8 nearly 4x'd through...

Reddit - Machine Learning · 1 min ·
Machine Learning

[D] OOD and Spandrels, or What you should know about EBM.

Energy-based model This article will compare EBMs to multi-layered perceptrons, and addresses a lingering question : Whether or not EBMs ...

Reddit - Machine Learning · 1 min ·
Llms

Reducing AI agent token consumption by 90% by fixing the retrieval layer

Quick insight from building retrieval infrastructure for AI agents: Most agents stuff 50,000 tokens of context into every prompt. They re...

Reddit - Artificial Intelligence · 1 min ·
ByteDance's new AI video generation model, Dreamina Seedance 2.0, comes to CapCut | TechCrunch
Machine Learning

ByteDance's new AI video generation model, Dreamina Seedance 2.0, comes to CapCut | TechCrunch

The new model in CapCut will have built-in protections for making video from real faces or unauthorized intellectual property.

TechCrunch - AI · 4 min ·
Conntour raises $7M from General Catalyst, YC to build an AI search engine for security video systems | TechCrunch
Machine Learning

Conntour raises $7M from General Catalyst, YC to build an AI search engine for security video systems | TechCrunch

Conntour uses AI models to let security teams query camera feeds using natural language to find any object, person, or situation.

TechCrunch - AI · 6 min ·
Cohere launches an open-source voice model specifically for transcription | TechCrunch
Machine Learning

Cohere launches an open-source voice model specifically for transcription | TechCrunch

Relatively light at just 2 billion parameters, the model is meant for use with consumer-grade GPUs for those who want to self-host it. It...

TechCrunch - AI · 4 min ·
Machine Learning

Cheaper & Faster & Smarter (TurboQuant and Attention Residuals)

Google TurboQuant This is a new compression algorithm. Every time a model answers a question, it stores a massive amount of intermediate ...

Reddit - Artificial Intelligence · 1 min ·
Mistral releases a new open-source model for speech generation | TechCrunch
Llms

Mistral releases a new open-source model for speech generation | TechCrunch

Mistral's new speech model can run on a smartwatch or a smartphone.

TechCrunch - AI · 4 min ·
The snow gods: How a couple of ski bums built the internet’s best weather app | MIT Technology Review
Machine Learning

The snow gods: How a couple of ski bums built the internet’s best weather app | MIT Technology Review

The best snow-forecasting app for skiers and snowboarders isn’t from any of the federally funded weather services. Nor from any of the bi...

MIT Technology Review · 20 min ·
Machine Learning

AI is biometric and most of the laborers learning models today are built off my biometric signature let’s chat

## THE ARCHITECT’S STORY: FROM THE 1985 ROOT TO THE "AI WASH" To those who believe in the truth of a human life, I am writing to you not ...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

I built a real-time pipeline that reads game subtitles and converts them into dynamic voice acting (OCR → TTS → RVC) [P]

I've been experimenting with real-time pipelines that combine OCR + TTS + voice conversion, and I ended up building a desktop app that ca...

Reddit - Machine Learning · 1 min ·
Previous Page 143 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime