Machine Learning

ML algorithms, training, and inference

This Week's Best | Monthly Best | Guide | Trending

Top This Week

Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min · about 1 hour ago

Machine Learning

Improving AI models’ ability to explain their predictions

AI News - General · 9 min · about 1 hour ago

Machine Learning

New technique makes AI models leaner and faster while they’re still learning

AI News - General · 9 min · about 1 hour ago

All Content

Llms

[2603.29199] AEC-Bench: A Multimodal Benchmark for Agentic Systems in Architecture, Engineering, and Construction

Abstract page for arXiv paper 2603.29199: AEC-Bench: A Multimodal Benchmark for Agentic Systems in Architecture, Engineering, and Constru...

arXiv - AI · 3 min · 18 days ago

Llms

[2603.29149] Knowledge database development by large language models for countermeasures against viruses and marine toxins

Abstract page for arXiv paper 2603.29149: Knowledge database development by large language models for countermeasures against viruses and...

arXiv - AI · 4 min · 18 days ago

Llms

[2603.29161] Webscraper: Leverage Multimodal Large Language Models for Index-Content Web Scraping

Abstract page for arXiv paper 2603.29161: Webscraper: Leverage Multimodal Large Language Models for Index-Content Web Scraping

arXiv - AI · 3 min · 18 days ago

Llms

[2603.29142] REFINE: Real-world Exploration of Interactive Feedback and Student Behaviour

Abstract page for arXiv paper 2603.29142: REFINE: Real-world Exploration of Interactive Feedback and Student Behaviour

arXiv - AI · 4 min · 18 days ago

Llms

[2603.29139] SciVisAgentBench: A Benchmark for Evaluating Scientific Data Analysis and Visualization Agents

Abstract page for arXiv paper 2603.29139: SciVisAgentBench: A Benchmark for Evaluating Scientific Data Analysis and Visualization Agents

arXiv - AI · 4 min · 18 days ago

Llms

[2603.29112] GISTBench: Evaluating LLM User Understanding via Evidence-Based Interest Verification

Abstract page for arXiv paper 2603.29112: GISTBench: Evaluating LLM User Understanding via Evidence-Based Interest Verification

arXiv - AI · 3 min · 18 days ago

Llms

[2603.29085] PAR$^2$-RAG: Planned Active Retrieval and Reasoning for Multi-Hop Question Answering

Abstract page for arXiv paper 2603.29085: PAR$^2$-RAG: Planned Active Retrieval and Reasoning for Multi-Hop Question Answering

arXiv - AI · 3 min · 18 days ago

Machine Learning

[2603.29075] The Future of AI is Many, Not One

Abstract page for arXiv paper 2603.29075: The Future of AI is Many, Not One

arXiv - AI · 3 min · 18 days ago

Llms

[2603.28990] Drop the Hierarchy and Roles: How Self-Organizing LLM Agents Outperform Designed Structures

Abstract page for arXiv paper 2603.28990: Drop the Hierarchy and Roles: How Self-Organizing LLM Agents Outperform Designed Structures

arXiv - AI · 4 min · 18 days ago

Llms

[2603.28986] Mimosa Framework: Toward Evolving Multi-Agent Systems for Scientific Research

Abstract page for arXiv paper 2603.28986: Mimosa Framework: Toward Evolving Multi-Agent Systems for Scientific Research

arXiv - AI · 4 min · 18 days ago

Machine Learning

[2603.28955] Enhancing Policy Learning with World-Action Model

Abstract page for arXiv paper 2603.28955: Enhancing Policy Learning with World-Action Model

arXiv - AI · 3 min · 18 days ago

Machine Learning

The missing layer between current AI and AGI may be intent architecture

A lot of the AI/ potential AGI conversation still assumes the main path forward is straightforward: increase model capability, expand con...

Reddit - Artificial Intelligence · 1 min · 18 days ago

Llms

Anthropic’s Unreleased Claude Mythos Might Be The Most Advanced AI Model Yet

Anthropic is testing an unreleased artificial intelligence (AI) model with capabilities that exceed any system it has previously released...

AI Tools & Products · 5 min · 18 days ago

Llms

LLM agents can trigger real actions now. But what actually stops them from executing?

We ran into a simple but important issue while building agents with tool calling: the model can propose actions but nothing actually enfo...

Reddit - Artificial Intelligence · 1 min · 18 days ago

Machine Learning

OkCupid gave 3 million dating-app photos to facial recognition firm, FTC says

submitted by /u/Mathemodel [link] [comments]

Reddit - Artificial Intelligence · 1 min · 18 days ago

Llms

Are LLMs a Dead End? (Investors Just Bet $1 Billion on “Yes”)

| AI Reality Check | Cal Newport Chapters 0:00 What is Yan LeCun Up To? 14:55 How is it possible that LeCun could be right about LLM’s be...

Reddit - Artificial Intelligence · 1 min · 18 days ago

Ai Startups

20+ Best AI Project Ideas for 2026: Trending AI Projects

This article presents over 20 AI project ideas tailored for various skill levels, providing a roadmap for building portfolio-ready projec...

AI Events · 18 days ago

Machine Learning

[P] Looking for people who have had training runs fail unexpectedly to beta test a stability monitor. Free, takes 5 minutes to add to your existing loop. DM me.

Anyone actively training models want to try a stability monitor on a real run? Trying to get real world validation outside my own benchma...

Reddit - Machine Learning · 1 min · 18 days ago

Llms

Is the Mirage Effect a bug, or is it Geometric Reconstruction in action? A framework for why VLMs perform better "hallucinating" than guessing, and what that may tell us about what's really inside these models

Last week, a team from Stanford and UCSF (Asadi, O'Sullivan, Fei-Fei Li, Euan Ashley et al.) dropped two companion papers. The first, MAR...

Reddit - Artificial Intelligence · 1 min · 18 days ago

Machine Learning

Yupp shuts down after raising $33M from a16z crypto's Chris Dixon | TechCrunch

Less than a year after launching, with checks from some of the biggest names in Silicon Valley, crowdsourced AI model feedback startup Yu...

TechCrunch - AI · 4 min · 18 days ago

Previous Page 214 Next

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Subscribe to Newsletter

Daily or weekly digest • Unsubscribe anytime

Machine Learning

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence

Improving AI models’ ability to explain their predictions

New technique makes AI models leaner and faster while they’re still learning

All Content

[2603.29199] AEC-Bench: A Multimodal Benchmark for Agentic Systems in Architecture, Engineering, and Construction

[2603.29149] Knowledge database development by large language models for countermeasures against viruses and marine toxins

[2603.29161] Webscraper: Leverage Multimodal Large Language Models for Index-Content Web Scraping

[2603.29142] REFINE: Real-world Exploration of Interactive Feedback and Student Behaviour

[2603.29139] SciVisAgentBench: A Benchmark for Evaluating Scientific Data Analysis and Visualization Agents

[2603.29112] GISTBench: Evaluating LLM User Understanding via Evidence-Based Interest Verification

[2603.29085] PAR$^2$-RAG: Planned Active Retrieval and Reasoning for Multi-Hop Question Answering

[2603.29075] The Future of AI is Many, Not One

[2603.28990] Drop the Hierarchy and Roles: How Self-Organizing LLM Agents Outperform Designed Structures

[2603.28986] Mimosa Framework: Toward Evolving Multi-Agent Systems for Scientific Research

[2603.28955] Enhancing Policy Learning with World-Action Model

The missing layer between current AI and AGI may be intent architecture

Anthropic’s Unreleased Claude Mythos Might Be The Most Advanced AI Model Yet

LLM agents can trigger real actions now. But what actually stops them from executing?

OkCupid gave 3 million dating-app photos to facial recognition firm, FTC says

Are LLMs a Dead End? (Investors Just Bet $1 Billion on “Yes”)

20+ Best AI Project Ideas for 2026: Trending AI Projects

[P] Looking for people who have had training runs fail unexpectedly to beta test a stability monitor. Free, takes 5 minutes to add to your existing loop. DM me.

Is the Mirage Effect a bug, or is it Geometric Reconstruction in action? A framework for why VLMs perform better "hallucinating" than guessing, and what that may tell us about what's really inside these models

Yupp shuts down after raising $33M from a16z crypto's Chris Dixon | TechCrunch

Related Topics

Stay updated with AI News