[2603.26731] Contextual inference from single objects in

[2603.26731] Contextual inference from single objects in Vision-Language models

arXiv - AI March 31, 2026 4 min read

About this article

Abstract page for arXiv paper 2603.26731: Contextual inference from single objects in Vision-Language models

Computer Science > Computer Vision and Pattern Recognition arXiv:2603.26731 (cs) [Submitted on 20 Mar 2026] Title:Contextual inference from single objects in Vision-Language models Authors:Martina G. Vilas, Timothy Schaumlöffel, Gemma Roig View a PDF of the paper titled Contextual inference from single objects in Vision-Language models, by Martina G. Vilas and 2 other authors View PDF HTML (experimental) Abstract:How much scene context a single object carries is a well-studied question in human scene perception, yet how this capacity is organized in vision-language models (VLMs) remains poorly understood, with direct implications for the robustness of these models. We investigate this question through a systematic behavioral and mechanistic analysis of contextual inference from single objects. Presenting VLMs with single objects on masked backgrounds, we probe their ability to infer both fine-grained scene category and coarse superordinate context (indoor vs. outdoor). We found that single objects support above-chance inference at both levels, with performance modulated by the same object properties that predict human scene categorization. Object identity, scene, and superordinate predictions are partially dissociable: accurate inference at one level neither requires nor guarantees accurate inference at the others, and the degree of coupling differs markedly across models. Mechanistically, object representations that remain stable when background context is removed are mor...

Originally published on March 31, 2026. Curated by AI News.

Llms

Nvidia goes all-in on AI agents while Anthropic pulls the plug

TLDR: Nvidia is partnering with 17 major companies to build a platform specifically for enterprise AI agents, basically trying to become ...

Reddit - Artificial Intelligence · 1 min · about 1 hour ago

Llms

Anthropic says Claude Code subscribers will need to pay extra for OpenClaw usage | TechCrunch

It’s about to become more expensive for Claude Code subscribers to use Anthropic’s coding assistant with OpenClaw and other third-party t...

TechCrunch - AI · 4 min · about 2 hours ago

Llms

I am seeing Claude everywhere

Every single Instagram reel or TikTok I scroll i see people mentioning Claude and glazing it like it’s some kind of master tool that’s be...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Llms

Claude Opus 4.6 API at 40% below Anthropic pricing – try free before you pay anything

Hey everyone I've set up a self-hosted API gateway using [New-API](QuantumNous/new-ap) to manage and distribute Claude Opus 4.6 access ac...

Reddit - Artificial Intelligence · 1 min · about 6 hours ago

[2603.26731] Contextual inference from single objects in Vision-Language models

About this article

Related Articles

Nvidia goes all-in on AI agents while Anthropic pulls the plug

Anthropic says Claude Code subscribers will need to pay extra for OpenClaw usage | TechCrunch

I am seeing Claude everywhere

Claude Opus 4.6 API at 40% below Anthropic pricing – try free before you pay anything

No comments

Stay updated with AI News