DINO-RotateMatch: A Rotation-Aware Deep Framework for Robust Image Matching in Large-Scale 3D Reconstruction
Kaichen Zhang, Tianxiang Sheng, Xuanming Shi
TL;DR
The paper tackles robust image matching for large-scale 3D reconstruction from unstructured Internet images by introducing DINO-RotateMatch, a rotation-aware, self-supervised framework. It combines DINO-based image pairing with rotation-enhanced keypoint extraction (ALIKED) and rotation-aware matching (LightGlue), followed by COLMAP for 3D reconstruction. Key contributions include a dual-path image pairing strategy for small vs. large datasets and an explicit rotation augmentation pipeline that increases correspondences and robustness to viewpoint changes, demonstrated on the Kaggle Image Matching Challenge 2025 with a Silver Award. The findings show substantial improvements in mean Average Accuracy over strong baselines, highlighting the method’s scalability and robustness for real-world large-scale reconstructions from diverse image collections.
Abstract
This paper presents DINO-RotateMatch, a deep-learning framework designed to address the chal lenges of image matching in large-scale 3D reconstruction from unstructured Internet images. The method integrates a dataset-adaptive image pairing strategy with rotation-aware keypoint extraction and matching. DINO is employed to retrieve semantically relevant image pairs in large collections, while rotation-based augmentation captures orientation-dependent local features using ALIKED and Light Glue. Experiments on the Kaggle Image Matching Challenge 2025 demonstrate consistent improve ments in mean Average Accuracy (mAA), achieving a Silver Award (47th of 943 teams). The results confirm that combining self-supervised global descriptors with rotation-enhanced local matching offers a robust and scalable solution for large-scale 3D reconstruction.
