A Subspace-Constrained Tyler's Estimator and its Applications to Structure from Motion
Feng Yu, Teng Zhang, Gilad Lerman
TL;DR
The paper tackles robust subspace recovery in the presence of heavy outliers by proposing the Subspace-Constrained Tyler's Estimator (STE), a method that regularizes Tyler's M-estimator to concentrate energy on a chosen $d$-subspace. STE blends Tyler's M-estimator with a variant of the fast median subspace, enabling reliable subspace recovery with fewer inliers and improved computational efficiency. The authors provide theoretical guarantees under a generalized haystack model, showing STE can succeed with DS-SNR below 1 given good initialization, and they validate the method on Structure from Motion tasks, including robust fundamental matrix estimation and initial camera removal, where STE demonstrates competitive or superior performance to state-of-the-art RSR methods. The results suggest STE is a practical and principled tool for robust 3D reconstruction pipelines, offering improvements in accuracy and speed and opening avenues for further integration and analysis in robust vision tasks.
Abstract
We present the subspace-constrained Tyler's estimator (STE) designed for recovering a low-dimensional subspace within a dataset that may be highly corrupted with outliers. STE is a fusion of the Tyler's M-estimator (TME) and a variant of the fast median subspace. Our theoretical analysis suggests that, under a common inlier-outlier model, STE can effectively recover the underlying subspace, even when it contains a smaller fraction of inliers relative to other methods in the field of robust subspace recovery. We apply STE in the context of Structure from Motion (SfM) in two ways: for robust estimation of the fundamental matrix and for the removal of outlying cameras, enhancing the robustness of the SfM pipeline. Numerical experiments confirm the state-of-the-art performance of our method in these applications. This research makes significant contributions to the field of robust subspace recovery, particularly in the context of computer vision and 3D reconstruction.
