CloudFixer: Test-Time Adaptation for 3D Point Clouds via Diffusion-Guided Geometric Transformation
Hajin Shim, Changhun Kim, Eunho Yang
TL;DR
CloudFixer introduces a diffusion-guided, per-sample input adaptation for 3D point clouds to combat real-world noise and distribution shifts. By optimizing a geometry-based transformation y_phi(x) (rotations and per-point displacements) to align with a diffusion model trained on the source domain, and by avoiding backpropagation through the diffusion model, it achieves real-time adaptation. An optional online model adaptation (CloudFixer-O) further aligns predictions between original and adapted inputs, improving robustness under batch and stream conditions. Extensive experiments on ModelNet40-C and PointDA-10 across multiple backbones show state-of-the-art performance under challenging and mild conditions, with strong efficiency and broad architectural applicability. The approach demonstrates practical impact for on-device 3D perception, offering a principled, diffusion-informed pathway to reliable test-time adaptation in 3D vision.
Abstract
3D point clouds captured from real-world sensors frequently encompass noisy points due to various obstacles, such as occlusion, limited resolution, and variations in scale. These challenges hinder the deployment of pre-trained point cloud recognition models trained on clean point clouds, leading to significant performance degradation. While test-time adaptation (TTA) strategies have shown promising results on this issue in the 2D domain, their application to 3D point clouds remains under-explored. Among TTA methods, an input adaptation approach, which directly converts test instances to the source domain using a pre-trained diffusion model, has been proposed in the 2D domain. Despite its robust TTA performance in practical situations, naively adopting this into the 3D domain may be suboptimal due to the neglect of inherent properties of point clouds, and its prohibitive computational cost. Motivated by these limitations, we propose CloudFixer, a test-time input adaptation method tailored for 3D point clouds, employing a pre-trained diffusion model. Specifically, CloudFixer optimizes geometric transformation parameters with carefully designed objectives that leverage the geometric properties of point clouds. We also substantially improve computational efficiency by avoiding backpropagation through the diffusion model and a prohibitive generation process. Furthermore, we propose an online model adaptation strategy by aligning the original model prediction with that of the adapted input. Extensive experiments showcase the superiority of CloudFixer over various TTA baselines, excelling in handling common corruptions and natural distribution shifts across diverse real-world scenarios. Our code is available at https://github.com/shimazing/CloudFixer
