MeshVPR: Citywide Visual Place Recognition Using 3D Meshes
Gabriele Berton, Lorenz Junglas, Riccardo Zaccone, Thomas Pollok, Barbara Caputo, Carlo Masone
TL;DR
MeshVPR addresses citywide visual place recognition using 3D textured meshes by closing the domain gap between real query images and synthetic mesh-derived databases. It introduces a lightweight feature alignment framework that fine-tunes a synthetic-model to align with a real-model embeddings, enabling effective retrieval from synthetic databases with pretrained VPR backbones. The authors provide three new city-scale test sets with freely available meshes and demonstrate that MeshVPR delivers competitive performance while enabling scalable deployment, data reuse, and privacy advantages; they also analyze mesh quality, Syn2Real gap bridging, and training data requirements. The work points to practical implications for scalable mesh-based localization and outlines future directions such as full mesh-based VL pipelines, multi-domain synthetic imagery, and drone-like viewpoints for localization at city scale.
Abstract
Mesh-based scene representation offers a promising direction for simplifying large-scale hierarchical visual localization pipelines, combining a visual place recognition step based on global features (retrieval) and a visual localization step based on local features. While existing work demonstrates the viability of meshes for visual localization, the impact of using synthetic databases rendered from them in visual place recognition remains largely unexplored. In this work we investigate using dense 3D textured meshes for large-scale Visual Place Recognition (VPR). We identify a significant performance drop when using synthetic mesh-based image databases compared to real-world images for retrieval. To address this, we propose MeshVPR, a novel VPR pipeline that utilizes a lightweight features alignment framework to bridge the gap between real-world and synthetic domains. MeshVPR leverages pre-trained VPR models and is efficient and scalable for city-wide deployments. We introduce novel datasets with freely available 3D meshes and manually collected queries from Berlin, Paris, and Melbourne. Extensive evaluations demonstrate that MeshVPR achieves competitive performance with standard VPR pipelines, paving the way for mesh-based localization systems. Data, code, and interactive visualizations are available at https://meshvpr.github.io/
