A dynamic memory assignment strategy for dilation-based ICP algorithm on embedded GPUs
Qiong Chang, Weimin Wang, Junpei Zhong, Jun Miyazaki
TL;DR
This work tackles the memory bottleneck of VANICP when running on resource-limited embedded GPUs by introducing a GPU-oriented dynamic memory assignment strategy for voxel-based dilation in ICP. The method uses a GPU voxel occupancy histogram and CPU-side address offsets with indirect addressing across three memory spaces to drastically reduce memory footprint while maintaining VANICP's performance. Experimental results show memory reductions over 97% and favorable energy efficiency across large point clouds on embedded platforms, with publication of source code for reproducibility. The approach enables real-time, memory-efficient point-cloud registration on edge devices, broadening the practical deployment of dilation-based ICP frameworks.
Abstract
This paper proposes a memory-efficient optimization strategy for the high-performance point cloud registration algorithm VANICP, enabling lightweight execution on embedded GPUs with constrained hardware resources. VANICP is a recently published acceleration framework that significantly improves the computational efficiency of point-cloud-based applications. By transforming the global nearest neighbor search into a localized process through a dilation-based information propagation mechanism, VANICP greatly reduces the computational complexity of the NNS. However, its original implementation demands a considerable amount of memory, which restricts its deployment in resource-constrained environments such as embedded systems. To address this issue, we propose a GPU-oriented dynamic memory assignment strategy that optimizes the memory usage of the dilation operation. Furthermore, based on this strategy, we construct an enhanced version of the VANICP framework that achieves over 97% reduction in memory consumption while preserving the original performance. Source code is published on: https://github.com/changqiong/VANICP4Em.git.
