[2603.25686] Just Zoom In: Cross-View Geo-Localization via

[2603.25686] Just Zoom In: Cross-View Geo-Localization via Autoregressive Zooming

arXiv - AI March 27, 2026 4 min read

About this article

Abstract page for arXiv paper 2603.25686: Just Zoom In: Cross-View Geo-Localization via Autoregressive Zooming

Computer Science > Computer Vision and Pattern Recognition arXiv:2603.25686 (cs) [Submitted on 26 Mar 2026] Title:Just Zoom In: Cross-View Geo-Localization via Autoregressive Zooming Authors:Yunus Talha Erzurumlu, Jiyong Kwag, Alper Yilmaz View a PDF of the paper titled Just Zoom In: Cross-View Geo-Localization via Autoregressive Zooming, by Yunus Talha Erzurumlu and 2 other authors View PDF HTML (experimental) Abstract:Cross-view geo-localization (CVGL) estimates a camera's location by matching a street-view image to geo-referenced overhead imagery, enabling GPS-denied localization and navigation. Existing methods almost universally formulate CVGL as an image-retrieval problem in a contrastively trained embedding space. This ties performance to large batches and hard negative mining, and it ignores both the geometric structure of maps and the coverage mismatch between street-view and overhead imagery. In particular, salient landmarks visible from the street view can fall outside a fixed satellite crop, making retrieval targets ambiguous and limiting explicit spatial inference over the map. We propose Just Zoom In, an alternative formulation that performs CVGL via autoregressive zooming over a city-scale overhead map. Starting from a coarse satellite view, the model takes a short sequence of zoom-in decisions to select a terminal satellite cell at a target resolution, without contrastive losses or hard negative mining. We further introduce a realistic benchmark with crowd-...

Originally published on March 27, 2026. Curated by AI News.

Machine Learning

[D] Looking for definition of open-world ish learning problem

Hello! Recently I did a project where I initially had around 30 target classes. But at inference, the model had to be able to handle a lo...

Reddit - Machine Learning · 1 min · about 4 hours ago

Llms

[2603.11687] SemBench: A Universal Semantic Framework for LLM Evaluation

Abstract page for arXiv paper 2603.11687: SemBench: A Universal Semantic Framework for LLM Evaluation

arXiv - AI · 4 min · about 9 hours ago

Llms

[2603.11583] UtilityMax Prompting: A Formal Framework for Multi-Objective Large Language Model Optimization

Abstract page for arXiv paper 2603.11583: UtilityMax Prompting: A Formal Framework for Multi-Objective Large Language Model Optimization

arXiv - AI · 3 min · about 9 hours ago

Machine Learning

[2512.05245] STAR-GO: Improving Protein Function Prediction by Learning to Hierarchically Integrate Ontology-Informed Semantic Embeddings

Abstract page for arXiv paper 2512.05245: STAR-GO: Improving Protein Function Prediction by Learning to Hierarchically Integrate Ontology...

arXiv - Machine Learning · 4 min · about 9 hours ago

[2603.25686] Just Zoom In: Cross-View Geo-Localization via Autoregressive Zooming

About this article

Related Articles

[D] Looking for definition of open-world ish learning problem

[2603.11687] SemBench: A Universal Semantic Framework for LLM Evaluation

[2603.11583] UtilityMax Prompting: A Formal Framework for Multi-Objective Large Language Model Optimization

[2512.05245] STAR-GO: Improving Protein Function Prediction by Learning to Hierarchically Integrate Ontology-Informed Semantic Embeddings

No comments

Stay updated with AI News