[2603.19609] LoD-Loc v3: Generalized Aerial Localization in Dense Cities using Instance Silhouette Alignment
About this article
Abstract page for arXiv paper 2603.19609: LoD-Loc v3: Generalized Aerial Localization in Dense Cities using Instance Silhouette Alignment
Computer Science > Computer Vision and Pattern Recognition arXiv:2603.19609 (cs) [Submitted on 20 Mar 2026] Title:LoD-Loc v3: Generalized Aerial Localization in Dense Cities using Instance Silhouette Alignment Authors:Shuaibang Peng, Juelin Zhu, Xia Li, Kun Yang, Maojun Zhang, Yu Liu, Shen Yan View a PDF of the paper titled LoD-Loc v3: Generalized Aerial Localization in Dense Cities using Instance Silhouette Alignment, by Shuaibang Peng and 6 other authors View PDF HTML (experimental) Abstract:We present LoD-Loc v3, a novel method for generalized aerial visual localization in dense urban environments. While prior work LoD-Loc v2 achieves localization through semantic building silhouette alignment with low-detail city models, it suffers from two key limitations: poor cross-scene generalization and frequent failure in dense building scenes. Our method addresses these challenges through two key innovations. First, we develop a new synthetic data generation pipeline that produces InsLoD-Loc - the largest instance segmentation dataset for aerial imagery to date, comprising 100k images with precise instance building annotations. This enables trained models to exhibit remarkable zero-shot generalization capability. Second, we reformulate the localization paradigm by shifting from semantic to instance silhouette alignment, which significantly reduces pose estimation ambiguity in dense scenes. Extensive experiments demonstrate that LoD-Loc v3 outperforms existing state-of-the-art (...