CurveCloudNet: Processing Point Clouds with 1D Structure
Colton Stearns, Davis Rempe, Jiateng Liu, Alex Fu, Sebastien Mascha, Jeong Joon Park, Despoina Paschalidou, Leonidas J. Guibas
TL;DR
CurveCloudNet tackles semantic segmentation and object understanding in 3D point clouds by exploiting the 1D curve structure inherent in laser scans. It introduces a curve cloud representation—polylines derived from per-beam lidar sweeps—and a set of 1D curve operations (1D FPS, curve grouping, symmetric curve convolution) that are interleaved with traditional point-based processing in a U-Net–style backbone. The approach demonstrates strong, scalable performance across ShapeNet, KortX, A2D2, nuScenes, and KITTI, outperforming or matching state-of-the-art point-based and voxel-based methods while better handling large outdoor scenes and diverse scanning patterns. Together, the curve-centric paradigm provides a flexible, efficient backbone for 3D perception in open-world robotics, with clear avenues for extending curve reasoning to broader data types and future integrations such as curve-to-curve attention and cross-curve interactions.
Abstract
Modern depth sensors such as LiDAR operate by sweeping laser-beams across the scene, resulting in a point cloud with notable 1D curve-like structures. In this work, we introduce a new point cloud processing scheme and backbone, called CurveCloudNet, which takes advantage of the curve-like structure inherent to these sensors. While existing backbones discard the rich 1D traversal patterns and rely on generic 3D operations, CurveCloudNet parameterizes the point cloud as a collection of polylines (dubbed a "curve cloud"), establishing a local surface-aware ordering on the points. By reasoning along curves, CurveCloudNet captures lightweight curve-aware priors to efficiently and accurately reason in several diverse 3D environments. We evaluate CurveCloudNet on multiple synthetic and real datasets that exhibit distinct 3D size and structure. We demonstrate that CurveCloudNet outperforms both point-based and sparse-voxel backbones in various segmentation settings, notably scaling to large scenes better than point-based alternatives while exhibiting improved single-object performance over sparse-voxel alternatives. In all, CurveCloudNet is an efficient and accurate backbone that can handle a larger variety of 3D environments than past works.
