[2602.07943] IV Co-Scientist: Multi-Agent LLM Framework for Causal Instrumental Variable Discovery

[2602.07943] IV Co-Scientist: Multi-Agent LLM Framework for Causal Instrumental Variable Discovery

arXiv - AI 3 min read

About this article

Abstract page for arXiv paper 2602.07943: IV Co-Scientist: Multi-Agent LLM Framework for Causal Instrumental Variable Discovery

Computer Science > Artificial Intelligence arXiv:2602.07943 (cs) [Submitted on 8 Feb 2026 (v1), last revised 6 Apr 2026 (this version, v2)] Title:IV Co-Scientist: Multi-Agent LLM Framework for Causal Instrumental Variable Discovery Authors:Ivaxi Sheth, Zhijing Jin, Bryan Wilder, Dominik Janzing, Mario Fritz View a PDF of the paper titled IV Co-Scientist: Multi-Agent LLM Framework for Causal Instrumental Variable Discovery, by Ivaxi Sheth and 4 other authors View PDF HTML (experimental) Abstract:In the presence of confounding between an endogenous variable and the outcome, instrumental variables (IVs) are used to isolate the causal effect of the endogenous variable. Identifying valid instruments requires interdisciplinary knowledge, creativity, and contextual understanding, making it a non-trivial task. In this paper, we investigate whether large language models (LLMs) can aid in this task. We perform a two-stage evaluation framework. First, we test whether LLMs can recover well-established instruments from the literature, assessing their ability to replicate standard reasoning. Second, we evaluate whether LLMs can identify and avoid instruments that have been empirically or theoretically discredited. Building on these results, we introduce IV Co-Scientist, a multi-agent system that proposes, critiques, and refines IVs for a given treatment-outcome pair. We also introduce a statistical test to contextualize consistency in the absence of ground truth. Our results show the po...

Originally published on April 07, 2026. Curated by AI News.

Related Articles

ChatGPT downloads are slowing — and may cause problems for OpenAI’s IPO | The Verge
Llms

ChatGPT downloads are slowing — and may cause problems for OpenAI’s IPO | The Verge

Data from Sensor Tower shows ChatGPT’s growth is slowing down, as Claude and other competitors’ growth is increasing, just as OpenAI is p...

The Verge - AI · 4 min ·
Larry Ellison’s betting everything on OpenAI. Will it pay off or pop the bubble? | The Verge
Llms

Larry Ellison’s betting everything on OpenAI. Will it pay off or pop the bubble? | The Verge

Larry Ellison and Oracle have staked their future on a data center deal with OpenAI and a big bet that enterprise AI will pay off.

The Verge - AI · 32 min ·
Llms

Google just released Deep Research Max — an autonomous research agent that writes expert-grade reports on its own

Google quietly dropped something interesting last week. They updated their Deep Research agent (available via Gemini API) and introduced ...

Reddit - Artificial Intelligence · 1 min ·
When Robots Have Their ChatGPT Moment, Remember These Pincers | WIRED
Llms

When Robots Have Their ChatGPT Moment, Remember These Pincers | WIRED

From sorting chicken nuggets to screwing in light bulbs, Eka’s robots are eerily lifelike. But do they have real physical smarts?

Wired - AI · 13 min ·
More in Llms: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime