[2603.28986] Mimosa Framework: Toward Evolving Multi-Agent Systems for Scientific Research

[2603.28986] Mimosa Framework: Toward Evolving Multi-Agent Systems for Scientific Research

arXiv - AI 4 min read

About this article

Abstract page for arXiv paper 2603.28986: Mimosa Framework: Toward Evolving Multi-Agent Systems for Scientific Research

Computer Science > Artificial Intelligence arXiv:2603.28986 (cs) [Submitted on 30 Mar 2026] Title:Mimosa Framework: Toward Evolving Multi-Agent Systems for Scientific Research Authors:Martin Legrand, Tao Jiang, Matthieu Feraud, Benjamin Navet, Yousouf Taghzouti, Fabien Gandon, Elise Dumont, Louis-Félix Nothias View a PDF of the paper titled Mimosa Framework: Toward Evolving Multi-Agent Systems for Scientific Research, by Martin Legrand and 7 other authors View PDF HTML (experimental) Abstract:Current Autonomous Scientific Research (ASR) systems, despite leveraging large language models (LLMs) and agentic architectures, remain constrained by fixed workflows and toolsets that prevent adaptation to evolving tasks and environments. We introduce Mimosa, an evolving multi-agent framework that automatically synthesizes task-specific multi-agent workflows and iteratively refines them through experimental feedback. Mimosa leverages the Model Context Protocol (MCP) for dynamic tool discovery, generates workflow topologies via a meta-orchestrator, executes subtasks through code-generating agents that invoke available tools and scientific software libraries, and scores executions with an LLM-based judge whose feedback drives workflow refinement. On ScienceAgentBench, Mimosa achieves a success rate of 43.1% with DeepSeek-V3.2, surpassing both single-agent baselines and static multi-agent configurations. Our results further reveal that models respond heterogeneously to multi-agent decom...

Originally published on April 01, 2026. Curated by AI News.

Related Articles

Llms

Deterministic vs. probabilistic guardrails for agentic AI — our approach and an open-source tool [D]

We've been thinking hard about whether safety guardrails for AI agents should be LLM-based (probabilistic) or rule-based (deterministic)....

Reddit - Machine Learning · 1 min ·
The 12-month window | TechCrunch
Llms

The 12-month window | TechCrunch

A lot of AI startups exist partly because the foundation models haven't expanded into their category yet. As many jokingly acknowledge, t...

TechCrunch - AI · 3 min ·
Llms

How LLMs decide which pages to cite — and how to optimize for it

When ChatGPT or Perplexity answers a question, it runs RAG: retrieves top candidates from a crawled index, then scores them. The scoring ...

Reddit - Artificial Intelligence · 1 min ·
Llms

Why is every AI getting restricted these days?

Like seriously, it’s not just ChatGPT... it’s Claude, Grok, Gemini… all of them feel way more locked down than before. I genuinely don’t ...

Reddit - Artificial Intelligence · 1 min ·
More in Llms: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime