EDM: Equirectangular Projection-Oriented Dense Kernelized Feature Matching
Dongki Jung, Jaehoon Choi, Yonghan Lee, Somi Jeong, Taejae Lee, Dinesh Manocha, Suyong Yeon
TL;DR
EDM addresses the challenge of dense feature matching for omnidirectional ERP images where projection distortions hinder traditional perspective methods. It introduces a distortion-aware dense matcher that operates in a unified spherical framework using a Spherical Spatial Alignment Module and Geodesic Flow Refinement, aided by spherical positional embeddings and bidirectional spherical-cartesian transformations. The method achieves state-of-the-art accuracy on Matterport3D and Stanford2D3D, with $AUC@5deg$ gains of +26.72 and +42.62, and qualitative robustness on EgoNeRF and OmniPhotos. By optimizing angular agreement on the unit sphere and promoting distortion-aware representations, EDM advances practical localization and mapping for omnidirectional imagery. Limitations include indoor, gravity-aligned data bias; future work expands data diversity and downstream tasks such as localization and mapping for omnidirectional imagery.
Abstract
We introduce the first learning-based dense matching algorithm, termed Equirectangular Projection-Oriented Dense Kernelized Feature Matching (EDM), specifically designed for omnidirectional images. Equirectangular projection (ERP) images, with their large fields of view, are particularly suited for dense matching techniques that aim to establish comprehensive correspondences across images. However, ERP images are subject to significant distortions, which we address by leveraging the spherical camera model and geodesic flow refinement in the dense matching method. To further mitigate these distortions, we propose spherical positional embeddings based on 3D Cartesian coordinates of the feature grid. Additionally, our method incorporates bidirectional transformations between spherical and Cartesian coordinate systems during refinement, utilizing a unit sphere to improve matching performance. We demonstrate that our proposed method achieves notable performance enhancements, with improvements of +26.72 and +42.62 in AUC@5° on the Matterport3D and Stanford2D3D datasets.
