KPConvX: Modernizing Kernel Point Convolution with Kernel Attention
Hugues Thomas, Yao-Hung Hubert Tsai, Timothy D. Barfoot, Jian Zhang
TL;DR
This work revisits kernel-point convolution by introducing KPConvD, a depthwise variant, and KPConvX, which adds geometric kernel attention. By combining these operators with modern architectural designs and training strategies, the method achieves state-of-the-art results on S3DIS, ScanNetv2, and ScanObjectNN, while maintaining efficiency through nearest-kernel operations and shell-based kernel point layouts. The ablations demonstrate substantial gains from depthwise design, attention modulation, and carefully chosen kernel shells and groupings. Overall, KPConvX offers a robust, scalable framework for 3D point-cloud understanding, bridging geometric kernels with attention-like modulation for improved accuracy and efficiency.
Abstract
In the field of deep point cloud understanding, KPConv is a unique architecture that uses kernel points to locate convolutional weights in space, instead of relying on Multi-Layer Perceptron (MLP) encodings. While it initially achieved success, it has since been surpassed by recent MLP networks that employ updated designs and training strategies. Building upon the kernel point principle, we present two novel designs: KPConvD (depthwise KPConv), a lighter design that enables the use of deeper architectures, and KPConvX, an innovative design that scales the depthwise convolutional weights of KPConvD with kernel attention values. Using KPConvX with a modern architecture and training strategy, we are able to outperform current state-of-the-art approaches on the ScanObjectNN, Scannetv2, and S3DIS datasets. We validate our design choices through ablation studies and release our code and models.
