[2603.28026] When Choices Become Priors: Contrastive Decoding for Scientific Figure Multiple-Choice QA

[2603.28026] When Choices Become Priors: Contrastive Decoding for Scientific Figure Multiple-Choice QA

arXiv - AI 3 min read

About this article

Abstract page for arXiv paper 2603.28026: When Choices Become Priors: Contrastive Decoding for Scientific Figure Multiple-Choice QA

Computer Science > Artificial Intelligence arXiv:2603.28026 (cs) [Submitted on 30 Mar 2026] Title:When Choices Become Priors: Contrastive Decoding for Scientific Figure Multiple-Choice QA Authors:Taeyun Roh, Eun-yeong Jo, Wonjune Jang, Jaewoo Kang View a PDF of the paper titled When Choices Become Priors: Contrastive Decoding for Scientific Figure Multiple-Choice QA, by Taeyun Roh and 3 other authors View PDF HTML (experimental) Abstract:Scientific figure multiple-choice question answering (MCQA) requires models to reason over diverse visual evidence, ranging from charts and multipanel figures to microscopy and biomedical images. However, this setting suffers from a distinctive bias: answer choices themselves can act as priors, steering multimodal models toward scientifically plausible options even when the figure supports a different answer. We investigate this failure mode through a simple question: what if decoding explicitly discounts what the model would prefer from text alone, so as to favor figure-grounded evidence? To this end, we propose SCICON, a training-free decoding method that scores each candidate by subtracting a text-only option score from its image-conditioned counterpart. Unlike prior contrastive decoding approaches that mitigate hallucinations by contrasting original inputs with distorted images or perturbed instructions, SCICON directly targets the choice-induced prior encoded in candidate text. Across three scientific figure QA benchmarks and three mo...

Originally published on March 31, 2026. Curated by AI News.

Related Articles

Machine Learning

[R] Architecture Determines Optimization: Deriving Weight Updates from Network Topology (seeking arXiv endorsement - cs.LG)

Abstract: We derive neural network weight updates from first principles without assuming gradient descent or a specific loss function. St...

Reddit - Machine Learning · 1 min ·
Machine Learning

[P] ML project (XGBoost + Databricks + MLflow) — how to talk about “production issues” in interviews?

Hey all, I recently built an end-to-end fraud detection project using a large banking dataset: Trained an XGBoost model Used Databricks f...

Reddit - Machine Learning · 1 min ·
Machine Learning

[D] The memory chip market lost tens of billions over a paper this community would have understood in 10 minutes

TurboQuant was teased recently and tens of billions gone from memory chip market in 48 hours but anyone in this community who read the pa...

Reddit - Machine Learning · 1 min ·
Copilot is ‘for entertainment purposes only,’ according to Microsoft’s terms of use | TechCrunch
Machine Learning

Copilot is ‘for entertainment purposes only,’ according to Microsoft’s terms of use | TechCrunch

AI skeptics aren’t the only ones warning users not to unthinkingly trust models’ outputs — that’s what the AI companies say themselves in...

TechCrunch - AI · 3 min ·
More in Machine Learning: This Week Guide Trending

No comments

No comments yet. Be the first to comment!

Stay updated with AI News

Get the latest news, tools, and insights delivered to your inbox.

Daily or weekly digest • Unsubscribe anytime