Self-Parameterization Based Multi-Resolution Mesh Convolution Networks
Shi Hezi, Jiang Luo, Zheng Jianmin, Zeng Jun
TL;DR
This work introduces SPMM-Net, a self-parameterization based multi-resolution mesh convolution framework that extends image-style dense prediction to irregular 3D meshes. It constructs a multi-resolution mesh pyramid via bijective surface-to-surface mappings and employs area-aware pooling, barycentric upsampling, and face-convolution within a HRNet-inspired architecture to maintain high-resolution representations across stages. The approach achieves state-of-the-art performance on mesh segmentation and shape correspondence benchmarks, validating the effectiveness of preserving high-resolution information and cross-resolution fusion for dense mesh predictions. The work enables more accurate mesh analysis in geometric modeling while highlighting practical limitations related to mesh topology, parameterization quality, and certain CNN operations not supported.
Abstract
This paper addresses the challenges of designing mesh convolution neural networks for 3D mesh dense prediction. While deep learning has achieved remarkable success in image dense prediction tasks, directly applying or extending these methods to irregular graph data, such as 3D surface meshes, is nontrivial due to the non-uniform element distribution and irregular connectivity in surface meshes which make it difficult to adapt downsampling, upsampling, and convolution operations. In addition, commonly used multiresolution networks require repeated high-to-low and then low-to-high processes to boost the performance of recovering rich, high-resolution representations. To address these challenges, this paper proposes a self-parameterization-based multi-resolution convolution network that extends existing image dense prediction architectures to 3D meshes. The novelty of our approach lies in two key aspects. First, we construct a multi-resolution mesh pyramid directly from the high-resolution input data and propose area-aware mesh downsampling/upsampling operations that use sequential bijective inter-surface mappings between different mesh resolutions. The inter-surface mapping redefines the mesh, rather than reshaping it, which thus avoids introducing unnecessary errors. Second, we maintain the high-resolution representation in the multi-resolution convolution network, enabling multi-scale fusions to exchange information across parallel multi-resolution subnetworks, rather than through connections of high-to-low resolution subnetworks in series. These features differentiate our approach from most existing mesh convolution networks and enable more accurate mesh dense predictions, which is confirmed in experiments.
