[2604.01634] CRIT: Graph-Based Automatic Data Synthesis to Enhance

[2604.01634] CRIT: Graph-Based Automatic Data Synthesis to Enhance Cross-Modal Multi-Hop Reasoning

arXiv - Machine Learning April 03, 2026 3 min read

About this article

Abstract page for arXiv paper 2604.01634: CRIT: Graph-Based Automatic Data Synthesis to Enhance Cross-Modal Multi-Hop Reasoning

Computer Science > Machine Learning arXiv:2604.01634 (cs) [Submitted on 2 Apr 2026] Title:CRIT: Graph-Based Automatic Data Synthesis to Enhance Cross-Modal Multi-Hop Reasoning Authors:Junyoung Sung, Seungwoo Lyu, Minjun Kim, Sumin An, Arsha Nagrani, Paul Hongsuck Seo View a PDF of the paper titled CRIT: Graph-Based Automatic Data Synthesis to Enhance Cross-Modal Multi-Hop Reasoning, by Junyoung Sung and 5 other authors View PDF HTML (experimental) Abstract:Real-world reasoning often requires combining information across modalities, connecting textual context with visual cues in a multi-hop process. Yet, most multimodal benchmarks fail to capture this ability: they typically rely on single images or set of images, where answers can be inferred from a single modality alone. This limitation is mirrored in the training data, where interleaved image-text content rarely enforces complementary, multi-hop reasoning. As a result, Vision-Language Models (VLMs) frequently hallucinate and produce reasoning traces poorly grounded in visual evidence. To address this gap, we introduce CRIT, a new dataset and benchmark built with a graph-based automatic pipeline for generating complex cross-modal reasoning tasks. CRIT consists of diverse domains ranging from natural images, videos, and text-rich sources, and includes a manually verified test set for reliable evaluation. Experiments on this benchmark reveal that even state-of-the-art models struggle on such reasoning tasks. Models trained ...

Originally published on April 03, 2026. Curated by AI News.

Llms

Anthropic investigates unauthorized access to restricted Claude Mythos AI model

Anthropic investigates unauthorized access to restricted Claude Mythos AI model - SiliconANGLE

AI Tools & Products · 5 min · 6 minutes ago

Machine Learning

ALGORITHMIC WARFARE: AI Models Used by Pentagon Susceptible to Foreign Influence

AI Tools & Products · 6 minutes ago

Llms

Arc Sentry outperformed LLM Guard 92% vs 70% detection on a head to head benchmark. Here is how it works.

I built Arc Sentry, a pre-generation prompt injection detector for open-weight LLMs. Instead of scanning text for patterns after the fact...

Reddit - Artificial Intelligence · 1 min · 19 minutes ago

Llms

2b or not 2b ? Custom LLM Scheduling Competition [P]

Hey everyone, I am generally interested in resource management and notably reducing the token cost for a given answer. So I just launched...

Reddit - Machine Learning · 1 min · 19 minutes ago

[2604.01634] CRIT: Graph-Based Automatic Data Synthesis to Enhance Cross-Modal Multi-Hop Reasoning

About this article

Related Articles

Anthropic investigates unauthorized access to restricted Claude Mythos AI model

ALGORITHMIC WARFARE: AI Models Used by Pentagon Susceptible to Foreign Influence

Arc Sentry outperformed LLM Guard 92% vs 70% detection on a head to head benchmark. Here is how it works.

2b or not 2b ? Custom LLM Scheduling Competition [P]

No comments

Stay updated with AI News