ESCAPE: Equivariant Shape Completion via Anchor Point Encoding
Burak Bekci, Nassir Navab, Federico Tombari, Mahdi Saleh
TL;DR
ESCAPE addresses the challenge of rotation-robust 3D shape completion without pose estimation by introducing an anchor-point distance encoding. Points are described by distances to $k$ anchor points, forming a distance matrix $D_p \in \mathbb{R}^{n\times k}$ that an encoder-decoder transformer processes, followed by a Levenberg-Marquardt optimization to recover coordinates. The framework provides reconstruction uniqueness when $k \ge d+1$ and achieves constant error bounds for the distance-based representation, yielding rotation-equivariant outputs validated on PCN, OmniObject3D, and KITTI cars. Empirical results demonstrate robust, high-fidelity completions under arbitrary rotations and partiality, outperforming canonical-alignment baselines and enabling practical deployment in dynamic environments without additional pose-estimation modules.
Abstract
Shape completion, a crucial task in 3D computer vision, involves predicting and filling the missing regions of scanned or partially observed objects. Current methods expect known pose or canonical coordinates and do not perform well under varying rotations, limiting their real-world applicability. We introduce ESCAPE (Equivariant Shape Completion via Anchor Point Encoding), a novel framework designed to achieve rotation-equivariant shape completion. Our approach employs a distinctive encoding strategy by selecting anchor points from a shape and representing all points as a distance to all anchor points. This enables the model to capture a consistent, rotation-equivariant understanding of the object's geometry. ESCAPE leverages a transformer architecture to encode and decode the distance transformations, ensuring that generated shape completions remain accurate and equivariant under rotational transformations. Subsequently, we perform optimization to calculate the predicted shapes from the encodings. Experimental evaluations demonstrate that ESCAPE achieves robust, high-quality reconstructions across arbitrary rotations and translations, showcasing its effectiveness in real-world applications without additional pose estimation modules.
