Machine Learning

ML algorithms, training, and inference

Top This Week

UMKC Announces New Master of Science in Artificial Intelligence
Ai Infrastructure

UMKC Announces New Master of Science in Artificial Intelligence

UMKC announces a new Master of Science in Artificial Intelligence program aimed at addressing workforce demand for AI expertise, set to l...

AI News - General · 4 min ·
Improving AI models’ ability to explain their predictions
Machine Learning

Improving AI models’ ability to explain their predictions

AI News - General · 9 min ·
New technique makes AI models leaner and faster while they’re still learning
Machine Learning

New technique makes AI models leaner and faster while they’re still learning

AI News - General · 9 min ·

All Content

[2603.29199] AEC-Bench: A Multimodal Benchmark for Agentic Systems in Architecture, Engineering, and Construction
Llms

[2603.29199] AEC-Bench: A Multimodal Benchmark for Agentic Systems in Architecture, Engineering, and Construction

Abstract page for arXiv paper 2603.29199: AEC-Bench: A Multimodal Benchmark for Agentic Systems in Architecture, Engineering, and Constru...

arXiv - AI · 3 min ·
[2603.29149] Knowledge database development by large language models for countermeasures against viruses and marine toxins
Llms

[2603.29149] Knowledge database development by large language models for countermeasures against viruses and marine toxins

Abstract page for arXiv paper 2603.29149: Knowledge database development by large language models for countermeasures against viruses and...

arXiv - AI · 4 min ·
[2603.29161] Webscraper: Leverage Multimodal Large Language Models for Index-Content Web Scraping
Llms

[2603.29161] Webscraper: Leverage Multimodal Large Language Models for Index-Content Web Scraping

Abstract page for arXiv paper 2603.29161: Webscraper: Leverage Multimodal Large Language Models for Index-Content Web Scraping

arXiv - AI · 3 min ·
[2603.29142] REFINE: Real-world Exploration of Interactive Feedback and Student Behaviour
Llms

[2603.29142] REFINE: Real-world Exploration of Interactive Feedback and Student Behaviour

Abstract page for arXiv paper 2603.29142: REFINE: Real-world Exploration of Interactive Feedback and Student Behaviour

arXiv - AI · 4 min ·
[2603.29139] SciVisAgentBench: A Benchmark for Evaluating Scientific Data Analysis and Visualization Agents
Llms

[2603.29139] SciVisAgentBench: A Benchmark for Evaluating Scientific Data Analysis and Visualization Agents

Abstract page for arXiv paper 2603.29139: SciVisAgentBench: A Benchmark for Evaluating Scientific Data Analysis and Visualization Agents

arXiv - AI · 4 min ·
[2603.29112] GISTBench: Evaluating LLM User Understanding via Evidence-Based Interest Verification
Llms

[2603.29112] GISTBench: Evaluating LLM User Understanding via Evidence-Based Interest Verification

Abstract page for arXiv paper 2603.29112: GISTBench: Evaluating LLM User Understanding via Evidence-Based Interest Verification

arXiv - AI · 3 min ·
[2603.29085] PAR$^2$-RAG: Planned Active Retrieval and Reasoning for Multi-Hop Question Answering
Llms

[2603.29085] PAR$^2$-RAG: Planned Active Retrieval and Reasoning for Multi-Hop Question Answering

Abstract page for arXiv paper 2603.29085: PAR$^2$-RAG: Planned Active Retrieval and Reasoning for Multi-Hop Question Answering

arXiv - AI · 3 min ·
[2603.29075] The Future of AI is Many, Not One
Machine Learning

[2603.29075] The Future of AI is Many, Not One

Abstract page for arXiv paper 2603.29075: The Future of AI is Many, Not One

arXiv - AI · 3 min ·
[2603.28990] Drop the Hierarchy and Roles: How Self-Organizing LLM Agents Outperform Designed Structures
Llms

[2603.28990] Drop the Hierarchy and Roles: How Self-Organizing LLM Agents Outperform Designed Structures

Abstract page for arXiv paper 2603.28990: Drop the Hierarchy and Roles: How Self-Organizing LLM Agents Outperform Designed Structures

arXiv - AI · 4 min ·
[2603.28986] Mimosa Framework: Toward Evolving Multi-Agent Systems for Scientific Research
Llms

[2603.28986] Mimosa Framework: Toward Evolving Multi-Agent Systems for Scientific Research

Abstract page for arXiv paper 2603.28986: Mimosa Framework: Toward Evolving Multi-Agent Systems for Scientific Research

arXiv - AI · 4 min ·
[2603.28955] Enhancing Policy Learning with World-Action Model
Machine Learning

[2603.28955] Enhancing Policy Learning with World-Action Model

Abstract page for arXiv paper 2603.28955: Enhancing Policy Learning with World-Action Model

arXiv - AI · 3 min ·
Machine Learning

The missing layer between current AI and AGI may be intent architecture

A lot of the AI/ potential AGI conversation still assumes the main path forward is straightforward: increase model capability, expand con...

Reddit - Artificial Intelligence · 1 min ·
Anthropic’s Unreleased Claude Mythos Might Be The Most Advanced AI Model Yet
Llms

Anthropic’s Unreleased Claude Mythos Might Be The Most Advanced AI Model Yet

Anthropic is testing an unreleased artificial intelligence (AI) model with capabilities that exceed any system it has previously released...

AI Tools & Products · 5 min ·
Llms

LLM agents can trigger real actions now. But what actually stops them from executing?

We ran into a simple but important issue while building agents with tool calling: the model can propose actions but nothing actually enfo...

Reddit - Artificial Intelligence · 1 min ·
Machine Learning

OkCupid gave 3 million dating-app photos to facial recognition firm, FTC says

submitted by /u/Mathemodel [link] [comments]

Reddit - Artificial Intelligence · 1 min ·
Llms

Are LLMs a Dead End? (Investors Just Bet $1 Billion on “Yes”)

| AI Reality Check | Cal Newport Chapters 0:00 What is Yan LeCun Up To? 14:55 How is it possible that LeCun could be right about LLM’s be...

Reddit - Artificial Intelligence · 1 min ·
20+ Best AI Project Ideas for 2026: Trending AI Projects
Ai Startups

20+ Best AI Project Ideas for 2026: Trending AI Projects

This article presents over 20 AI project ideas tailored for various skill levels, providing a roadmap for building portfolio-ready projec...

AI Events ·
Machine Learning

[P] Looking for people who have had training runs fail unexpectedly to beta test a stability monitor. Free, takes 5 minutes to add to your existing loop. DM me.

Anyone actively training models want to try a stability monitor on a real run? Trying to get real world validation outside my own benchma...

Reddit - Machine Learning · 1 min ·
Llms

Is the Mirage Effect a bug, or is it Geometric Reconstruction in action? A framework for why VLMs perform better "hallucinating" than guessing, and what that may tell us about what's really inside these models

Last week, a team from Stanford and UCSF (Asadi, O'Sullivan, Fei-Fei Li, Euan Ashley et al.) dropped two companion papers. The first, MAR...

Reddit - Artificial Intelligence · 1 min ·
Yupp shuts down after raising $33M from a16z crypto's Chris Dixon | TechCrunch
Machine Learning

Yupp shuts down after raising $33M from a16z crypto's Chris Dixon | TechCrunch

Less than a year after launching, with checks from some of the biggest names in Silicon Valley, crowdsourced AI model feedback startup Yu...

TechCrunch - AI · 4 min ·
Previous Page 214 Next

Related Topics

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime