CRA-PCN: Point Cloud Completion with Intra- and Inter-level Cross-Resolution Transformers
Yi Rong, Haoran Zhou, Lixin Yuan, Cheng Mei, Jiahao Wang, Tong Lu
TL;DR
CRA-PCN addresses the challenge of incomplete point clouds by introducing Cross-Resolution Transformer (CRT) that performs explicit cross-resolution aggregation with local attention across multiple scales. CRT comes in inter-level and intra-level forms, sharing a unified implementation and is embedded into a three-block up-sampling decoder to form a coarse-to-fine CRA-PCN encoder–decoder. Extensive experiments on PCN, ShapeNet-55/34, and MVP show state-of-the-art performance and reveal the critical benefit of combining intra- and inter-level CRA along with recursive multi-scale design. The proposed approach provides a plug-in, efficient mechanism for learning rich multi-scale local geometry, with strong generalization and practical impact for 3D reconstruction tasks.
Abstract
Point cloud completion is an indispensable task for recovering complete point clouds due to incompleteness caused by occlusion, limited sensor resolution, etc. The family of coarse-to-fine generation architectures has recently exhibited great success in point cloud completion and gradually became mainstream. In this work, we unveil one of the key ingredients behind these methods: meticulously devised feature extraction operations with explicit cross-resolution aggregation. We present Cross-Resolution Transformer that efficiently performs cross-resolution aggregation with local attention mechanisms. With the help of our recursive designs, the proposed operation can capture more scales of features than common aggregation operations, which is beneficial for capturing fine geometric characteristics. While prior methodologies have ventured into various manifestations of inter-level cross-resolution aggregation, the effectiveness of intra-level one and their combination has not been analyzed. With unified designs, Cross-Resolution Transformer can perform intra- or inter-level cross-resolution aggregation by switching inputs. We integrate two forms of Cross-Resolution Transformers into one up-sampling block for point generation, and following the coarse-to-fine manner, we construct CRA-PCN to incrementally predict complete shapes with stacked up-sampling blocks. Extensive experiments demonstrate that our method outperforms state-of-the-art methods by a large margin on several widely used benchmarks. Codes are available at https://github.com/EasyRy/CRA-PCN.
