A Taxonomy of Structure from Motion Methods
Federica Arrigoni
TL;DR
This paper presents a conceptual taxonomy for Structure from Motion (SfM) methods by partitioning approaches into Structure and Motion (SAM), Structure from Motion (SFM), and Structure without Motion (SWM). It emphasizes graph-based formulations and analyzes theoretical conditions that ensure well-posedness, including degeneracies and solvability criteria for calibrated and uncalibrated cameras. The survey covers classic geometry-based pipelines and also surveys recent data-driven trends, including learning-based matrix completion and differentiable SfM components. It highlights the trade-offs between sequential/global strategies, initialization requirements, and the potential for combining theory with practice to handle degeneracies and large-scale data. The work aims to guide researchers toward principled SfM designs with clear assumptions and provable properties.
Abstract
Structure from Motion (SfM) refers to the problem of recovering both structure (i.e., 3D coordinates of points in the scene) and motion (i.e., camera matrices) starting from point correspondences in multiple images. It has attracted significant attention over the years, counting practical reconstruction pipelines as well as theoretical results. This paper is conceived as a conceptual review of SfM methods, which are grouped into three main categories, according to which part of the problem - between motion and structure - they focus on. The proposed taxonomy brings a new perspective on existing SfM approaches as well as insights into open problems and possible future research directions. Particular emphasis is given on identifying the theoretical conditions that make SfM well posed, which depend on the problem formulation that is being considered.
