LAHNet: Local Attentive Hashing Network for Point Cloud Registration
Wentao Qu, Xiaoshui Huang, Liang Xiao
TL;DR
LAHNet addresses the need for broader receptive fields in point cloud registration descriptors by introducing locality-biased local attention through a Group Transformer that uses locality-sensitive hashing to partition points into non-overlapping windows and a cross-window interaction mechanism. An additional Interaction Transformer leverages an overlap matrix to enhance matching of overlap regions between paired clouds. The approach achieves state-of-the-art results on 3DMatch, 3DLoMatch, and KITTI, validating its robustness in both high- and low-overlap and outdoor scenarios. Overall, the paper presents a scalable, efficient framework that effectively models long-range dependencies in 3D data for improved registration accuracy.
Abstract
Most existing learning-based point cloud descriptors for point cloud registration focus on perceiving local information of point clouds to generate distinctive features. However, a reasonable and broader receptive field is essential for enhancing feature distinctiveness. In this paper, we propose a Local Attentive Hashing Network for point cloud registration, called LAHNet, which introduces a local attention mechanism with the inductive bias of locality of convolution-like operators into point cloud descriptors. Specifically, a Group Transformer is designed to capture reasonable long-range context between points. This employs a linear neighborhood search strategy, Locality-Sensitive Hashing, enabling uniformly partitioning point clouds into non-overlapping windows. Meanwhile, an efficient cross-window strategy is adopted to further expand the reasonable feature receptive field. Furthermore, building on this effective windowing strategy, we propose an Interaction Transformer to enhance the feature interactions of the overlap regions within point cloud pairs. This computes an overlap matrix to match overlap regions between point cloud pairs by representing each window as a global signal. Extensive results demonstrate that LAHNet can learn robust and distinctive features, achieving significant registration results on real-world indoor and outdoor benchmarks.
