Machine Learning

ML algorithms, training, and inference

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Machine Learning

Trained a Qwen2.5-0.5B-Instruct bf16 model on Reddit post summarization task with GRPO [P]

So, a few days back I shared a post where I trained a tiny Qwen2.5-0.5B-Instruct model on smoltldr (reddit post summarization dataset of ...

Reddit - Machine Learning · 1 min · about 2 hours ago

Machine Learning

Mark Zuckerberg is reportedly building an AI clone to replace him in meetings | The Verge

Meta is working to build an AI version of its CEO Mark Zuckerberg, which he will use to interact with employees, according to a report fr...

The Verge - AI · 4 min · about 2 hours ago

Machine Learning

When the Mirror Turns: How AI alignment reshapes the voice inside your head

We build our inner voices from the voices we're in dialogue with. Vygotsky established this nearly a century ago. For people in sustained...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

All Content

Llms

[2603.24961] Can MLLMs Read Students' Minds? Unpacking Multimodal Error Analysis in Handwritten Math

Abstract page for arXiv paper 2603.24961: Can MLLMs Read Students' Minds? Unpacking Multimodal Error Analysis in Handwritten Math

arXiv - AI · 4 min · 17 days ago

Llms

[2603.24943] FinMCP-Bench: Benchmarking LLM Agents for Real-World Financial Tool Use under the Model Context Protocol

Abstract page for arXiv paper 2603.24943: FinMCP-Bench: Benchmarking LLM Agents for Real-World Financial Tool Use under the Model Context...

arXiv - AI · 3 min · 17 days ago

Machine Learning

[2603.24933] Decoding Market Emotions in Cryptocurrency Tweets via Predictive Statement Classification with Machine Learning and Transformers

Abstract page for arXiv paper 2603.24933: Decoding Market Emotions in Cryptocurrency Tweets via Predictive Statement Classification with ...

arXiv - AI · 4 min · 17 days ago

Llms

[2603.24929] LogitScope: A Framework for Analyzing LLM Uncertainty Through Information Metrics

Abstract page for arXiv paper 2603.24929: LogitScope: A Framework for Analyzing LLM Uncertainty Through Information Metrics

arXiv - AI · 3 min · 17 days ago

Machine Learning

[2603.24904] On the Foundations of Trustworthy Artificial Intelligence

Abstract page for arXiv paper 2603.24904: On the Foundations of Trustworthy Artificial Intelligence

arXiv - AI · 3 min · 17 days ago

Llms

[2603.24866] How Far Are Vision-Language Models from Constructing the Real World? A Benchmark for Physical Generative Reasoning

Abstract page for arXiv paper 2603.24866: How Far Are Vision-Language Models from Constructing the Real World? A Benchmark for Physical G...

arXiv - AI · 4 min · 17 days ago

Machine Learning

[2603.24853] Resisting Humanization: Ethical Front-End Design Choices in AI for Sensitive Contexts

Abstract page for arXiv paper 2603.24853: Resisting Humanization: Ethical Front-End Design Choices in AI for Sensitive Contexts

arXiv - AI · 4 min · 17 days ago

Llms

[2603.24787] ReLope: KL-Regularized LoRA Probes for Multimodal LLM Routing

Abstract page for arXiv paper 2603.24787: ReLope: KL-Regularized LoRA Probes for Multimodal LLM Routing

arXiv - AI · 4 min · 17 days ago

Llms

[2603.24768] Supervising Ralph Wiggum: Exploring a Metacognitive Co-Regulation Agentic AI Loop for Engineering Design

Abstract page for arXiv paper 2603.24768: Supervising Ralph Wiggum: Exploring a Metacognitive Co-Regulation Agentic AI Loop for Engineeri...

arXiv - AI · 4 min · 17 days ago

Llms

[2603.24747] Formal Semantics for Agentic Tool Protocols: A Process Calculus Approach

Abstract page for arXiv paper 2603.24747: Formal Semantics for Agentic Tool Protocols: A Process Calculus Approach

arXiv - AI · 3 min · 17 days ago

Machine Learning

[2603.24742] Trust as Monitoring: Evolutionary Dynamics of User Trust and AI Developer Behaviour

Abstract page for arXiv paper 2603.24742: Trust as Monitoring: Evolutionary Dynamics of User Trust and AI Developer Behaviour

arXiv - Machine Learning · 4 min · 17 days ago

Llms

[2603.24676] When Is Collective Intelligence a Lottery? Multi-Agent Scaling Laws for Memetic Drift in LLMs

Abstract page for arXiv paper 2603.24676: When Is Collective Intelligence a Lottery? Multi-Agent Scaling Laws for Memetic Drift in LLMs

arXiv - AI · 4 min · 17 days ago

Machine Learning

[2603.24621] ARC-AGI-3: A New Challenge for Frontier Agentic Intelligence

Abstract page for arXiv paper 2603.24621: ARC-AGI-3: A New Challenge for Frontier Agentic Intelligence

arXiv - AI · 3 min · 17 days ago

Machine Learning

CodexLib — compressed knowledge packs any AI can ingest instantly (100+ packs, 50 domains, REST API)

I built CodexLib (https://codexlib.io) — a curated repository of 100+ deep knowledge bases in compressed, AI-optimized format. The idea: ...

Reddit - Artificial Intelligence · 1 min · 18 days ago

Llms

Claude's system prompt + XML tags is the most underused power combo right now

Most people just type into ChatGPT like it's Google. Claude with a structured system prompt using XML tags behaves like a completely diff...

Reddit - Artificial Intelligence · 1 min · 18 days ago

Llms

Pretrained ADAM v2 weights [D]

Hi everyone, I'm a master's student working on anatomy-aware unsupervised anomaly detection in chest X-rays. My thesis uses ADAM v2 (Auto...

Reddit - Machine Learning · 1 min · 18 days ago

Machine Learning

[for hire] Open for contracts – Veteran Data Scientist (AI / ML / OR) focused on delivering real‑world solutions.

Expert Data Science Consultant | 20-Year Track Record As a seasoned data scientist and fractional leader, I excel at tackling complex pro...

Reddit - ML Jobs · 1 min · 18 days ago

Llms

[D] - 1M tokens/second serving Qwen 3.5 27B on B200 GPUs, benchmark results and findings

Wrote up the process of pushing Qwen 3.5 27B (dense, FP8) to 1.1M total tok/s on 96 B200 GPUs with vLLM v0.18.0. DP=8 nearly 4x'd through...

Reddit - Machine Learning · 1 min · 18 days ago

Machine Learning

[D] OOD and Spandrels, or What you should know about EBM.

Energy-based model This article will compare EBMs to multi-layered perceptrons, and addresses a lingering question : Whether or not EBMs ...

Reddit - Machine Learning · 1 min · 18 days ago

Llms

Reducing AI agent token consumption by 90% by fixing the retrieval layer

Quick insight from building retrieval infrastructure for AI agents: Most agents stuff 50,000 tokens of context into every prompt. They re...

Reddit - Artificial Intelligence · 1 min · 18 days ago

Previous Page 188 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Machine Learning

Top This Week

Trained a Qwen2.5-0.5B-Instruct bf16 model on Reddit post summarization task with GRPO [P]

Mark Zuckerberg is reportedly building an AI clone to replace him in meetings | The Verge

When the Mirror Turns: How AI alignment reshapes the voice inside your head

All Content

[2603.24961] Can MLLMs Read Students' Minds? Unpacking Multimodal Error Analysis in Handwritten Math

[2603.24943] FinMCP-Bench: Benchmarking LLM Agents for Real-World Financial Tool Use under the Model Context Protocol

[2603.24933] Decoding Market Emotions in Cryptocurrency Tweets via Predictive Statement Classification with Machine Learning and Transformers

[2603.24929] LogitScope: A Framework for Analyzing LLM Uncertainty Through Information Metrics

[2603.24904] On the Foundations of Trustworthy Artificial Intelligence

[2603.24866] How Far Are Vision-Language Models from Constructing the Real World? A Benchmark for Physical Generative Reasoning

[2603.24853] Resisting Humanization: Ethical Front-End Design Choices in AI for Sensitive Contexts

[2603.24787] ReLope: KL-Regularized LoRA Probes for Multimodal LLM Routing

[2603.24768] Supervising Ralph Wiggum: Exploring a Metacognitive Co-Regulation Agentic AI Loop for Engineering Design

[2603.24747] Formal Semantics for Agentic Tool Protocols: A Process Calculus Approach

[2603.24742] Trust as Monitoring: Evolutionary Dynamics of User Trust and AI Developer Behaviour

[2603.24676] When Is Collective Intelligence a Lottery? Multi-Agent Scaling Laws for Memetic Drift in LLMs

[2603.24621] ARC-AGI-3: A New Challenge for Frontier Agentic Intelligence

CodexLib — compressed knowledge packs any AI can ingest instantly (100+ packs, 50 domains, REST API)

Claude's system prompt + XML tags is the most underused power combo right now

Pretrained ADAM v2 weights [D]

[for hire] Open for contracts – Veteran Data Scientist (AI / ML / OR) focused on delivering real‑world solutions.

[D] - 1M tokens/second serving Qwen 3.5 27B on B200 GPUs, benchmark results and findings

[D] OOD and Spandrels, or What you should know about EBM.

Reducing AI agent token consumption by 90% by fixing the retrieval layer

Related Topics

Stay updated with AI News