When Distances Lie: Euclidean Embeddings in the Presence of Outliers and Distance Violations
Matthias Bentert, Fedor V. Fomin, Petr A. Golovach, M. S. Ramanujan, Saket Saurabh
TL;DR
This work studies Euclidean Embedding Editing, the problem of editing a distance space on $n$ points with up to $k_O$ outlier deletions and $k_M$ distance modifications to obtain a space embeddable in $\mathbb{R}^d$. It develops a polynomial-time compression to a kernel of size $|X'|=\mathcal{O}((kd)^2)$, and uses this kernel to devise an FPT algorithm parameterized by $(k_O+k_M)$ and $d$, via a reduction to a bounded set of existential sentences solved by algebraic decision procedures. It also delivers specialized, tight parameterized algorithms for the outlier-only case, including a $\min\{(d+3)^{k_O}, 2^{d+k_O}\}$-time method and a randomized $2$-approximation in $2^d\cdot n^{\mathcal{O}(1)}$, plus a 2-approximation for the general unweighted version. The paper further establishes NP-hardness and W[1]-hardness results, clarifying the limits of fixed-parameter tractability for these problems and justifying the combined parameterization by $d$ and the solution size. Overall, the results advance robust Euclidean embeddings under outliers and distance violations with provable guarantees and practical algorithms.
Abstract
Distance geometry explores the properties of distance spaces that can be exactly represented as the pairwise Euclidean distances between points in $\mathbb{R}^d$ ($d \geq 1$), or equivalently, distance spaces that can be isometrically embedded in $\mathbb{R}^d$. In this work, we investigate whether a distance space can be isometrically embedded in $\mathbb{R}^d$ after applying a limited number of modifications. Specifically, we focus on two types of modifications: outlier deletion (removing points) and distance modification (adjusting distances between points). The central problem, Euclidean Embedding Editing (EEE), asks whether an input distance space on $n$ points can be transformed, using at most $k$ modifications, into a space that is isometrically embeddable in $\mathbb{R}^d$. We present several fixed-parameter tractable (FPT) and approximation algorithms for this problem. Our first result is an algorithm that solves EEE in time $(dk)^{\mathcal{O}(d+k)} + n^{\mathcal{O}(1)}$. The core subroutine of this algorithm, which is of independent interest, is a polynomial-time method for compressing the input distance space into an equivalent instance of EEE with $\mathcal{O}((dk)^2)$ points. For the special but important case of EEE where only outlier deletions are allowed, we improve the parameter dependence of the FPT algorithm and obtain a running time of $\min\{(d+3)^k, 2^{d+k}\} \cdot n^{\mathcal{O}(1)}$. Additionally, we provide an FPT-approximation algorithm for this problem, which outputs a set of at most $2 \cdot {\rm OPT}$ outliers in time $2^d \cdot n^{\mathcal{O}(1)}$. This 2-approximation algorithm improves upon the previous $(3+\varepsilon)$-approximation algorithm by Sidiropoulos, Wang, and Wang [SODA '17]. Furthermore, we complement our algorithms with hardness results motivating our choice of parameterizations.
