EdgeGaussians -- 3D Edge Mapping via Gaussian Splatting
Kunal Chelani, Assia Benbihi, Torsten Sattler, Fredrik Kahl
TL;DR
EdgeGaussians replaces implicit neural edge fields with explicit oriented 3D edge points modeled as Gaussians, learning each point's position via its mean $\mu$ and its edge direction via the principal axis of covariance $\Sigma$ with $G(x) = \exp(-\tfrac{1}{2}(x-\mu)^T \Sigma^{-1}(x-\mu))$ and enforcing $\Sigma = R S S^{\top} R^{\top}$. Training uses rendering-based losses and two geometric regularizers: $\mathcal{L}_{\text{orient}} = 1 - \frac{1}{N} \left( \frac{1}{k} \sum_{i=1}^{N} \sum_{j=1}^{k} |d_i^{\top} d_{i_j}| \right)$ and $\mathcal{L}_{\text{shape}} = \frac{1}{N} \sum_{i=1}^{N} \frac{{}^{2}s_i}{{}^{1}s_i}$, while supervision comes from 2D edge maps via a masked projection loss $\mathcal{L}_{\text{proj}}$. The learned Gaussians are clustered into edges by a graph traversal that enforces spatial proximity and orientation consistency, then fitted with either lines or cubic Bézier curves, selected by comparing curve vs. line residuals with a threshold $\delta$. Empirically, EdgeGaussians matches or exceeds state-of-the-art accuracy and completeness while offering an order-of-magnitude faster training than prior implicit methods, demonstrating practical benefits for fast 3D edge mapping from multi-view data.
Abstract
With their meaningful geometry and their omnipresence in the 3D world, edges are extremely useful primitives in computer vision. 3D edges comprise of lines and curves, and methods to reconstruct them use either multi-view images or point clouds as input. State-of-the-art image-based methods first learn a 3D edge point cloud then fit 3D edges to it. The edge point cloud is obtained by learning a 3D neural implicit edge field from which the 3D edge points are sampled on a specific level set (0 or 1). However, such methods present two important drawbacks: i) it is not realistic to sample points on exact level sets due to float imprecision and training inaccuracies. Instead, they are sampled within a range of levels so the points do not lie accurately on the 3D edges and require further processing. ii) Such implicit representations are computationally expensive and require long training times. In this paper, we address these two limitations and propose a 3D edge mapping that is simpler, more efficient, and preserves accuracy. Our method learns explicitly the 3D edge points and their edge direction hence bypassing the need for point sampling. It casts a 3D edge point as the center of a 3D Gaussian and the edge direction as the principal axis of the Gaussian. Such a representation has the advantage of being not only geometrically meaningful but also compatible with the efficient training optimization defined in Gaussian Splatting. Results show that the proposed method produces edges as accurate and complete as the state-of-the-art while being an order of magnitude faster. Code is released at https://github.com/kunalchelani/EdgeGaussians.
