A Dual Basis Approach for Structured Robust Euclidean Distance Geometry
Chandra Kundu, Abiy Tasissa, HanQin Cai
TL;DR
RoDEoDB addresses robust Euclidean Distance Geometry under structured anchor–target observations with sparse outliers by exploiting a non-orthogonal dual-basis mapping between the distance matrix and a low-rank Gram matrix. The method operates in two phases: first, a Dual Basis Alternating Projections (DBAP) step robustly recovers the Gram block from corrupted anchor–target data; second, Nyström reconstruction yields the full Gram matrix and the $d$-dimensional point configuration. Theoretical guarantees under $\mu$-incoherence and $\alpha$-sparsity show exact recovery of both the Gram matrix and the point set with high probability, and empirical results on synthetic and molecular datasets demonstrate superior robustness and accuracy compared to baselines, especially with limited anchors and higher corruption. This framework enables reliable localization and conformation tasks in sensor networks and molecular modeling where only anchor–target distances are available and noisy.
Abstract
Euclidean Distance Matrix (EDM), which consists of pairwise squared Euclidean distances of a given point configuration, finds many applications in modern machine learning. This paper considers the setting where only a set of anchor nodes is used to collect the distances between themselves and the rest. In the presence of potential outliers, it results in a structured partial observation on EDM with partial corruptions. Note that an EDM can be connected to a positive semi-definite Gram matrix via a non-orthogonal dual basis. Inspired by recent development of non-orthogonal dual basis in optimization, we propose a novel algorithmic framework, dubbed Robust Euclidean Distance Geometry via Dual Basis (RoDEoDB), for recovering the Euclidean distance geometry, i.e., the underlying point configuration. The exact recovery guarantees have been established in terms of both the Gram matrix and point configuration, under some mild conditions. Empirical experiments show superior performance of RoDEoDB on sensor localization and molecular conformation datasets.
