Alignment Scores: Robust Metrics for Multiview Pose Accuracy Evaluation
Seong Hun Lee, Javier Civera
TL;DR
This work introduces three robust, decoupled metrics for multiview pose evaluation: TAS for translation accuracy, RAS for rotation accuracy, and PAS as their mean to capture full 6-DOF pose quality. Each metric relies on robust alignment to ground truth, followed by construction of cumulative frequency histograms over carefully chosen thresholds—$d$ sets the translation thresholds via $d=\underset{i}{\mathrm{Q3}}\left(\min_{j\neq i}\|\mathbf{c}_i-\mathbf{c}_j\|\right)$ and rotation thresholds span $0.1^\circ$ to $10^\circ$—with TAS and RAS defined as $\mathrm{TAS}=\frac{1}{100n}\sum f_k$ and $\mathrm{RAS}=\frac{1}{100n}\sum f_k$. By decoupling translation and rotation and using robust registration (PCR-99 variant) as well as robust rotation averaging (sra2), the authors demonstrate that TAS and RAS offer superior robustness to outliers and collinear motion and are insensitive to trajectory length, while PAS provides a stable, single-score summary. Extensive simulations compare these metrics against ATE, DTE, and mAA, highlighting limitations of existing measures and showing the practical advantages of the proposed approach. The work concludes with usage guidelines and discusses limitations related to heuristic thresholds and aggregation choices, underscoring the metrics’ potential to improve reproducibility and interpretability in multiview pose evaluation.
Abstract
We propose three novel metrics for evaluating the accuracy of a set of estimated camera poses given the ground truth: Translation Alignment Score (TAS), Rotation Alignment Score (RAS), and Pose Alignment Score (PAS). The TAS evaluates the translation accuracy independently of the rotations, and the RAS evaluates the rotation accuracy independently of the translations. The PAS is the average of the two scores, evaluating the combined accuracy of both translations and rotations. The TAS is computed in four steps: (1) Find the upper quartile of the closest-pair-distances, $d$. (2) Align the estimated trajectory to the ground truth using a robust registration method. (3) Collect all distance errors and obtain the cumulative frequencies for multiple thresholds ranging from $0.01d$ to $d$ with a resolution $0.01d$. (4) Add up these cumulative frequencies and normalize them such that the theoretical maximum is 1. The TAS has practical advantages over the existing metrics in that (1) it is robust to outliers and collinear motion, and (2) there is no need to adjust parameters on different datasets. The RAS is computed in a similar manner to the TAS and is also shown to be more robust against outliers than the existing rotation metrics. We verify our claims through extensive simulations and provide in-depth discussion of the strengths and weaknesses of the proposed metrics.
