PolyhedronNet: Representation Learning for Polyhedra with Surface-attributed Graph
Dazhou Yu, Genpei Zhang, Liang Zhao
TL;DR
PolyhedronNet tackles polyhedra representation learning by introducing the surface-attributed graph (SAG) to unify vertices, edges, and faces with semantic attributes. It then decomposes SAG into local rigid representations along two-hop paths to preserve geometric and semantic information while achieving rotation and translation invariance, and applies a specialized graph neural network, PolyhedronGNN, to hierarchically aggregate these local representations into a global SAG embedding. The framework is underpinned by invertibility and completeness notions for recovering the original polyhedron from SAG-derived representations, and demonstrates state-of-the-art performance on four datasets for classification and retrieval. This approach enables robust, surface-aware understanding of polyhedral shapes with potential impact on CAD, architectural design, and 3D object analysis by providing a principled, scalable representation learning pipeline.
Abstract
Ubiquitous geometric objects can be precisely and efficiently represented as polyhedra. The transformation of a polyhedron into a vector, known as polyhedra representation learning, is crucial for manipulating these shapes with mathematical and statistical tools for tasks like classification, clustering, and generation. Recent years have witnessed significant strides in this domain, yet most efforts focus on the vertex sequence of a polyhedron, neglecting the complex surface modeling crucial in real-world polyhedral objects. This study proposes \textbf{PolyhedronNet}, a general framework tailored for learning representations of 3D polyhedral objects. We propose the concept of the surface-attributed graph to seamlessly model the vertices, edges, faces, and their geometric interrelationships within a polyhedron. To effectively learn the representation of the entire surface-attributed graph, we first propose to break it down into local rigid representations to effectively learn each local region's relative positions against the remaining regions without geometric information loss. Subsequently, we propose PolyhedronGNN to hierarchically aggregate the local rigid representation via intra-face and inter-face geometric message passing modules, to obtain a global representation that minimizes information loss while maintaining rotation and translation invariance. Our experimental evaluations on four distinct datasets, encompassing both classification and retrieval tasks, substantiate PolyhedronNet's efficacy in capturing comprehensive and informative representations of 3D polyhedral objects. Code and data are available at {https://github.com/dyu62/3D_polyhedron}.
