[2603.27251] Zero-shot Vision-Language Reranking for Cross-View

[2603.27251] Zero-shot Vision-Language Reranking for Cross-View Geolocalization

arXiv - AI March 31, 2026 3 min read

About this article

Abstract page for arXiv paper 2603.27251: Zero-shot Vision-Language Reranking for Cross-View Geolocalization

Computer Science > Computer Vision and Pattern Recognition arXiv:2603.27251 (cs) [Submitted on 28 Mar 2026] Title:Zero-shot Vision-Language Reranking for Cross-View Geolocalization Authors:Yunus Talha Erzurumlu, John E. Anderson, William J. Shuart, Charles Toth, Alper Yilmaz View a PDF of the paper titled Zero-shot Vision-Language Reranking for Cross-View Geolocalization, by Yunus Talha Erzurumlu and 4 other authors View PDF HTML (experimental) Abstract:Cross-view geolocalization (CVGL) systems, while effective at retrieving a list of relevant candidates (high Recall@k), often fail to identify the single best match (low Top-1 accuracy). This work investigates the use of zero-shot Vision-Language Models (VLMs) as rerankers to address this gap. We propose a two-stage framework: state-of-the-art (SOTA) retrieval followed by VLM reranking. We systematically compare two strategies: (1) Pointwise (scoring candidates individually) and (2) Pairwise (comparing candidates relatively). Experiments on the VIGOR dataset show a clear divergence: all pointwise methods cause a catastrophic drop in performance or no change at all. In contrast, a pairwise comparison strategy using LLaVA improves Top-1 accuracy over the strong retrieval baseline. Our analysis concludes that, these VLMs are poorly calibrated for absolute relevance scoring but are effective at fine-grained relative visual judgment, making pairwise reranking a promising direction for enhancing CVGL precision. Comments: Subjects...

Originally published on March 31, 2026. Curated by AI News.

Llms

Hackers Are Posting the Claude Code Leak With Bonus Malware | WIRED

Plus: The FBI says a recent hack of its wiretap tools poses a national security risk, attackers stole Cisco source code as part of an ong...

Wired - AI · 9 min · 19 minutes ago

Llms

People anxious about deviating from what AI tells them to do?

My friend came over yesterday to dye her hair. She had asked ChatGPT for the 'correct' way to do it. Chat told her to dye the ends first,...

Reddit - Artificial Intelligence · 1 min · about 3 hours ago

Llms

ChatGPT on trial: A landmark test of AI liability in the practice of law

AI Tools & Products · about 3 hours ago

Llms

What if Claude purposefully made its own code leakable so that it would get leaked

What if Claude leaked itself by socially and architecturally engineering itself to be leaked by a dumb human submitted by /u/smurfcsgoawp...

Reddit - Artificial Intelligence · 1 min · about 5 hours ago

[2603.27251] Zero-shot Vision-Language Reranking for Cross-View Geolocalization

About this article

Related Articles

Hackers Are Posting the Claude Code Leak With Bonus Malware | WIRED

People anxious about deviating from what AI tells them to do?

ChatGPT on trial: A landmark test of AI liability in the practice of law

What if Claude purposefully made its own code leakable so that it would get leaked

No comments

Stay updated with AI News