[2510.01448] GeoSURGE: Geo-localization using Semantic Fusion with

[2510.01448] GeoSURGE: Geo-localization using Semantic Fusion with Hierarchy of Geographic Embeddings

arXiv - AI March 30, 2026 3 min read

About this article

Abstract page for arXiv paper 2510.01448: GeoSURGE: Geo-localization using Semantic Fusion with Hierarchy of Geographic Embeddings

Computer Science > Computer Vision and Pattern Recognition arXiv:2510.01448 (cs) [Submitted on 1 Oct 2025 (v1), last revised 27 Mar 2026 (this version, v2)] Title:GeoSURGE: Geo-localization using Semantic Fusion with Hierarchy of Geographic Embeddings Authors:Angel Daruna, Nicholas Meegan, Han-Pang Chiu, Supun Samarasekera, Rakesh Kumar View a PDF of the paper titled GeoSURGE: Geo-localization using Semantic Fusion with Hierarchy of Geographic Embeddings, by Angel Daruna and 4 other authors View PDF HTML (experimental) Abstract:Worldwide visual geo-localization aims to determine the geographic location of an image anywhere on Earth using only its visual content. Despite recent progress, learning expressive representations of geographic space remains challenging due to the inherently low-dimensional nature of geographic coordinates. We formulate global geo-localization as aligning the visual representation of a query image with a learned geographic representation. Our approach explicitly models the world as a hierarchy of learned geographic embeddings, enabling a distributed and multi-scale representation of geographic space. In addition, we introduce a semantic fusion module that efficiently integrates appearance features with semantic segmentation through latent cross-attention, producing a more robust visual representation for localization. Experiments on five widely used geo-localization benchmarks demonstrate that our method achieves new state-of-the-art results on 22 ...

Originally published on March 30, 2026. Curated by AI News.

Llms

Is the Mirage Effect a bug, or is it Geometric Reconstruction in action? A framework for why VLMs perform better "hallucinating" than guessing, and what that may tell us about what's really inside these models

Last week, a team from Stanford and UCSF (Asadi, O'Sullivan, Fei-Fei Li, Euan Ashley et al.) dropped two companion papers. The first, MAR...

Reddit - Artificial Intelligence · 1 min · 24 minutes ago

Nlp

The Galaxy S26’s photo app can sloppify your memories | The Verge

Samsung’s S26 series offers some new AI photo editing capabilities to transform your photos. But where’s the line between acceptable edit...

The Verge - AI · 8 min · about 5 hours ago

Llms

[D] The problem with comparing AI memory system benchmarks — different evaluation methods make scores meaningless

I've been reviewing how various AI memory systems evaluate their performance and noticed a fundamental issue with cross-system comparison...

Reddit - Machine Learning · 1 min · about 10 hours ago

Machine Learning

[D] I had an idea, would love your thoughts

What happens that while training an AI during pre training we make it such that if makes "misaligned behaviour" then we just reduce like ...

Reddit - Machine Learning · 1 min · about 11 hours ago

[2510.01448] GeoSURGE: Geo-localization using Semantic Fusion with Hierarchy of Geographic Embeddings

About this article

Related Articles

Is the Mirage Effect a bug, or is it Geometric Reconstruction in action? A framework for why VLMs perform better "hallucinating" than guessing, and what that may tell us about what's really inside these models

The Galaxy S26’s photo app can sloppify your memories | The Verge

[D] The problem with comparing AI memory system benchmarks — different evaluation methods make scores meaningless

[D] I had an idea, would love your thoughts

No comments

Stay updated with AI News