Self-Supervised Implicit Attention Priors for Point Cloud Reconstruction
Kyle Fogarty, Chenyue Cai, Jing Yang, Zhilin Guo, Cengiz Öztireli
TL;DR
The paper tackles the ill-posed problem of reconstructing continuous surfaces from irregular point clouds by learning a shape-specific implicit prior directly from the input. It introduces a dictionary-conditioned neural field whose queries attend to a learnable token dictionary via cross-attention, producing a non-local self-prior that guides surface prediction. After learning, the zero-level set is densified and normals are derived from gradients, followed by a Robust Implicit MLS refinement to yield high-fidelity meshes with preserved detail and flexible topology. The method achieves state-of-the-art performance on self-similar shapes and demonstrates robustness to noise and sparsity, while operating without external training data and enabling accurate normal estimation through the gradient of the learned field.
Abstract
Recovering high-quality surfaces from irregular point cloud is ill-posed unless strong geometric priors are available. We introduce an implicit self-prior approach that distills a shape-specific prior directly from the input point cloud itself and embeds it within an implicit neural representation. This is achieved by jointly training a small dictionary of learnable embeddings with an implicit distance field; at every query location, the field attends to the dictionary via cross-attention, enabling the network to capture and reuse repeating structures and long-range correlations inherent to the shape. Optimized solely with self-supervised point cloud reconstruction losses, our approach requires no external training data. To effectively integrate this learned prior while preserving input fidelity, the trained field is then sampled to extract densely distributed points and analytic normals via automatic differentiation. We integrate the resulting dense point cloud and corresponding normals into a robust implicit moving least squares (RIMLS) formulation. We show this hybrid strategy preserves fine geometric details in the input data, while leveraging the learned prior to regularize sparse regions. Experiments show that our method outperforms both classical and learning-based approaches in generating high-fidelity surfaces with superior detail preservation and robustness to common data degradations.
