DM3D: Deformable Mamba via Offset-Guided Gaussian Sequencing for Point Cloud Understanding
Bin Liu, Chunyang Wang, Xuelian Liu
TL;DR
This work tackles point-cloud understanding with long-sequence modeling by learning a structure-aware serialization for State Space Models. DM3D introduces an offset-guided deformable scanning mechanism that unifies local Gaussian resampling and global differentiable reordering, via Gaussian-based KNN Resampling (GKR), Gaussian-based Differentiable Reordering (GDR), and LCFA, within a Deformable Mamba Block. A Tri-Path Frequency Fusion module reconciles information across three SSM paths and mitigates aliasing through frequency-domain processing. Across ModelNet40, ScanObjectNN, and ShapeNetPart, DM3D achieves state-of-the-art performance in classification, few-shot learning, and part segmentation, while enabling end-to-end training and providing code release.
Abstract
State Space Models (SSMs) demonstrate significant potential for long-sequence modeling, but their reliance on input order conflicts with the irregular nature of point clouds. Existing approaches often rely on predefined serialization strategies, which cannot adjust based on diverse geometric structures. To overcome this limitation, we propose \textbf{DM3D}, a deformable Mamba architecture for point cloud understanding. Specifically, DM3D introduces an offset-guided Gaussian sequencing mechanism that unifies local resampling and global reordering within a deformable scan. The Gaussian-based KNN Resampling (GKR) enhances structural awareness by adaptively reorganizing neighboring points, while the Gaussian-based Differentiable Reordering (GDR) enables end-to-end optimization of serialization order. Furthermore, a Tri-Path Frequency Fusion module enhances feature complementarity and reduces aliasing. Together, these components enable structure-adaptive serialization of point clouds. Extensive experiments on benchmark datasets show that DM3D achieves state-of-the-art performance in classification, few-shot learning, and part segmentation, demonstrating that adaptive serialization effectively unlocks the potential of SSMs for point cloud understanding. The code will be released at https://github.com/L1277471578/DM3D.
