DHGCN: Dynamic Hop Graph Convolution Network for Self-Supervised Point Cloud Learning
Jincen Jiang, Lizhi Zhao, Xuequan Lu, Wei Hu, Imran Razzak, Meili Wang
TL;DR
DHGCN tackles self-supervised learning for 3D point clouds by explicitly modeling contextual relationships between voxelized point parts through a self-supervised hop distance reconstruction task. It constructs a complete graph of parts, applies PartConvolution for part features, and uses a Hop Graph Attention mechanism that leverages learned hop distances (via a Gaussian kernel) to weight edge features during aggregation, with distances updated dynamically across layers. The approach yields a plug-and-play module compatible with common point-based backbones and demonstrates state-of-the-art unsupervised performance on downstream classification and shape part segmentation tasks, including robustness on real-world data. These results suggest that explicitly encoding part-level adjacency and distance information enhances representation learning for non-Euclidean 3D data, with practical impact for scalable 3D understanding without labels.
Abstract
Recent works attempt to extend Graph Convolution Networks (GCNs) to point clouds for classification and segmentation tasks. These works tend to sample and group points to create smaller point sets locally and mainly focus on extracting local features through GCNs, while ignoring the relationship between point sets. In this paper, we propose the Dynamic Hop Graph Convolution Network (DHGCN) for explicitly learning the contextual relationships between the voxelized point parts, which are treated as graph nodes. Motivated by the intuition that the contextual information between point parts lies in the pairwise adjacent relationship, which can be depicted by the hop distance of the graph quantitatively, we devise a novel self-supervised part-level hop distance reconstruction task and design a novel loss function accordingly to facilitate training. In addition, we propose the Hop Graph Attention (HGA), which takes the learned hop distance as input for producing attention weights to allow edge features to contribute distinctively in aggregation. Eventually, the proposed DHGCN is a plug-and-play module that is compatible with point-based backbone networks. Comprehensive experiments on different backbones and tasks demonstrate that our self-supervised method achieves state-of-the-art performance. Our source code is available at: https://github.com/Jinec98/DHGCN.
