Machine Learning

ML algorithms, training, and inference

Top This Week

Llms

World models will be the next big thing, bye-bye LLMs

Was at Nvidia's GTC conference recently and honestly, it was one of the most eye-opening events I've attended in a while. There was a lot...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

[D] Got my first offer after months of searching — below posted range, contract-to-hire, and worried it may pause my search. Do I take it?

I could really use some outside perspective. I’m a senior ML/CV engineer in Canada with about 5–6 years across research and industry. Mas...

Reddit - Machine Learning · 1 min ·
Machine Learning

[Research] AI training is bad, so I started an research

Hello, I started researching about AI training Q:Why? R: Because AI training is bad right now. Q: What do you mean its bad? R: Like when ...

Reddit - Machine Learning · 1 min ·

All Content

[2603.24961] Can MLLMs Read Students' Minds? Unpacking Multimodal Error Analysis in Handwritten Math
Llms

[2603.24961] Can MLLMs Read Students' Minds? Unpacking Multimodal Error Analysis in Handwritten Math

Abstract page for arXiv paper 2603.24961: Can MLLMs Read Students' Minds? Unpacking Multimodal Error Analysis in Handwritten Math

arXiv - AI · 4 min ·
[2603.24943] FinMCP-Bench: Benchmarking LLM Agents for Real-World Financial Tool Use under the Model Context Protocol
Llms

[2603.24943] FinMCP-Bench: Benchmarking LLM Agents for Real-World Financial Tool Use under the Model Context Protocol

Abstract page for arXiv paper 2603.24943: FinMCP-Bench: Benchmarking LLM Agents for Real-World Financial Tool Use under the Model Context...

arXiv - AI · 3 min ·
[2603.24933] Decoding Market Emotions in Cryptocurrency Tweets via Predictive Statement Classification with Machine Learning and Transformers
Machine Learning

[2603.24933] Decoding Market Emotions in Cryptocurrency Tweets via Predictive Statement Classification with Machine Learning and Transformers

Abstract page for arXiv paper 2603.24933: Decoding Market Emotions in Cryptocurrency Tweets via Predictive Statement Classification with ...

arXiv - AI · 4 min ·
[2603.24929] LogitScope: A Framework for Analyzing LLM Uncertainty Through Information Metrics
Llms

[2603.24929] LogitScope: A Framework for Analyzing LLM Uncertainty Through Information Metrics

Abstract page for arXiv paper 2603.24929: LogitScope: A Framework for Analyzing LLM Uncertainty Through Information Metrics

arXiv - AI · 3 min ·
[2603.24904] On the Foundations of Trustworthy Artificial Intelligence
Machine Learning

[2603.24904] On the Foundations of Trustworthy Artificial Intelligence

Abstract page for arXiv paper 2603.24904: On the Foundations of Trustworthy Artificial Intelligence

arXiv - AI · 3 min ·
[2603.24866] How Far Are Vision-Language Models from Constructing the Real World? A Benchmark for Physical Generative Reasoning
Llms

[2603.24866] How Far Are Vision-Language Models from Constructing the Real World? A Benchmark for Physical Generative Reasoning

Abstract page for arXiv paper 2603.24866: How Far Are Vision-Language Models from Constructing the Real World? A Benchmark for Physical G...

arXiv - AI · 4 min ·
[2603.24853] Resisting Humanization: Ethical Front-End Design Choices in AI for Sensitive Contexts
Machine Learning

[2603.24853] Resisting Humanization: Ethical Front-End Design Choices in AI for Sensitive Contexts

Abstract page for arXiv paper 2603.24853: Resisting Humanization: Ethical Front-End Design Choices in AI for Sensitive Contexts

arXiv - AI · 4 min ·
[2603.24787] ReLope: KL-Regularized LoRA Probes for Multimodal LLM Routing
Llms

[2603.24787] ReLope: KL-Regularized LoRA Probes for Multimodal LLM Routing

Abstract page for arXiv paper 2603.24787: ReLope: KL-Regularized LoRA Probes for Multimodal LLM Routing

arXiv - AI · 4 min ·
[2603.24768] Supervising Ralph Wiggum: Exploring a Metacognitive Co-Regulation Agentic AI Loop for Engineering Design
Llms

[2603.24768] Supervising Ralph Wiggum: Exploring a Metacognitive Co-Regulation Agentic AI Loop for Engineering Design

Abstract page for arXiv paper 2603.24768: Supervising Ralph Wiggum: Exploring a Metacognitive Co-Regulation Agentic AI Loop for Engineeri...

arXiv - AI · 4 min ·
[2603.24747] Formal Semantics for Agentic Tool Protocols: A Process Calculus Approach
Llms

[2603.24747] Formal Semantics for Agentic Tool Protocols: A Process Calculus Approach

Abstract page for arXiv paper 2603.24747: Formal Semantics for Agentic Tool Protocols: A Process Calculus Approach

arXiv - AI · 3 min ·
[2603.24742] Trust as Monitoring: Evolutionary Dynamics of User Trust and AI Developer Behaviour
Machine Learning

[2603.24742] Trust as Monitoring: Evolutionary Dynamics of User Trust and AI Developer Behaviour

Abstract page for arXiv paper 2603.24742: Trust as Monitoring: Evolutionary Dynamics of User Trust and AI Developer Behaviour

arXiv - Machine Learning · 4 min ·
[2603.24676] When Is Collective Intelligence a Lottery? Multi-Agent Scaling Laws for Memetic Drift in LLMs
Llms

[2603.24676] When Is Collective Intelligence a Lottery? Multi-Agent Scaling Laws for Memetic Drift in LLMs

Abstract page for arXiv paper 2603.24676: When Is Collective Intelligence a Lottery? Multi-Agent Scaling Laws for Memetic Drift in LLMs

arXiv - AI · 4 min ·
[2603.24621] ARC-AGI-3: A New Challenge for Frontier Agentic Intelligence
Machine Learning

[2603.24621] ARC-AGI-3: A New Challenge for Frontier Agentic Intelligence

Abstract page for arXiv paper 2603.24621: ARC-AGI-3: A New Challenge for Frontier Agentic Intelligence

arXiv - AI · 3 min ·
Machine Learning

CodexLib — compressed knowledge packs any AI can ingest instantly (100+ packs, 50 domains, REST API)

I built CodexLib (https://codexlib.io) — a curated repository of 100+ deep knowledge bases in compressed, AI-optimized format. The idea: ...

Reddit - Artificial Intelligence · 1 min ·
Llms

Claude's system prompt + XML tags is the most underused power combo right now

Most people just type into ChatGPT like it's Google. Claude with a structured system prompt using XML tags behaves like a completely diff...

Reddit - Artificial Intelligence · 1 min ·
Llms

Pretrained ADAM v2 weights [D]

Hi everyone, I'm a master's student working on anatomy-aware unsupervised anomaly detection in chest X-rays. My thesis uses ADAM v2 (Auto...

Reddit - Machine Learning · 1 min ·
Machine Learning

[for hire] Open for contracts – Veteran Data Scientist (AI / ML / OR) focused on delivering real‑world solutions.

Expert Data Science Consultant | 20-Year Track Record As a seasoned data scientist and fractional leader, I excel at tackling complex pro...

Reddit - ML Jobs · 1 min ·
Llms

[D] - 1M tokens/second serving Qwen 3.5 27B on B200 GPUs, benchmark results and findings

Wrote up the process of pushing Qwen 3.5 27B (dense, FP8) to 1.1M total tok/s on 96 B200 GPUs with vLLM v0.18.0. DP=8 nearly 4x'd through...

Reddit - Machine Learning · 1 min ·
Machine Learning

[D] OOD and Spandrels, or What you should know about EBM.

Energy-based model This article will compare EBMs to multi-layered perceptrons, and addresses a lingering question : Whether or not EBMs ...

Reddit - Machine Learning · 1 min ·
Llms

Reducing AI agent token consumption by 90% by fixing the retrieval layer

Quick insight from building retrieval infrastructure for AI agents: Most agents stuff 50,000 tokens of context into every prompt. They re...

Reddit - Artificial Intelligence · 1 min ·
Previous Page 29 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime