Test-time adaptation for geospatial point cloud semantic segmentation with distinct domain shifts

Puzuo Wang; Wei Yao; Jie Shao; Zhiyi He

Test-time adaptation for geospatial point cloud semantic segmentation with distinct domain shifts

Puzuo Wang, Wei Yao, Jie Shao, Zhiyi He

TL;DR

This work tackles test-time adaptation for geospatial point cloud semantic segmentation under three practical domain shifts by updating BN statistics progressively and optimizing BN affine parameters via self-supervision. The proposed framework combines progressive batch normalization (PBN) with information maximization and reliability-constrained pseudo-labeling to adapt a pre-trained model to unlabeled target data during inference, without access to source data. Across photogrammetric-to-ALS, ALS-to-MLS, and synthetic-to-MLS transfers, the method yields significant improvements in $mIoU$ and $OA$ (e.g., up to $mIoU$ gains of ~20 percentage points and demonstrations on SensatUrban to Hessigheim 3D: $mIoU=59.46\%$, $OA=85.97\%$). The results show that BN-centered adaptation is effective, robust to batch variations, and broadly applicable across backbones, providing a practical, privacy-preserving path for real-time domain adaptation in geospatial PCSS.

Abstract

Domain adaptation (DA) techniques help deep learning models generalize across data shifts for point cloud semantic segmentation (PCSS). Test-time adaptation (TTA) allows direct adaptation of a pre-trained model to unlabeled data during inference stage without access to source data or additional training, avoiding privacy issues and large computational resources. We address TTA for geospatial PCSS by introducing three domain shift paradigms: photogrammetric to airborne LiDAR, airborne to mobile LiDAR, and synthetic to mobile laser scanning. We propose a TTA method that progressively updates batch normalization (BN) statistics with each testing batch. Additionally, a self-supervised learning module optimizes learnable BN affine parameters. Information maximization and reliability-constrained pseudo-labeling improve prediction confidence and supply supervisory signals. Experimental results show our method improves classification accuracy by up to 20\% mIoU, outperforming other methods. For photogrammetric (SensatUrban) to airborne (Hessigheim 3D) adaptation at the inference stage, our method achieves 59.46\% mIoU and 85.97\% OA without retraining or fine-turning.

Test-time adaptation for geospatial point cloud semantic segmentation with distinct domain shifts

TL;DR

and

(e.g., up to

gains of ~20 percentage points and demonstrations on SensatUrban to Hessigheim 3D:

). The results show that BN-centered adaptation is effective, robust to batch variations, and broadly applicable across backbones, providing a practical, privacy-preserving path for real-time domain adaptation in geospatial PCSS.

Abstract

Paper Structure (37 sections, 12 equations, 10 figures, 9 tables)

This paper contains 37 sections, 12 equations, 10 figures, 9 tables.

Introduction
Related works
Unsupervised domain adaptation for point cloud semantic segmentation
Test-time adaptation
Batch normalization
Self-supervised learning
Methodology
Overview
Progressive batch normalization
Revisiting batch normalization in deep learning
Progressive adaptation
Learnable parameter adaptation with self-supervised learning
Information maximization
Pseudo-labeling with reliability constraint
Experiment
...and 22 more sections

Figures (10)

Figure 1: Typical domain shifts in handling geospatial point clouds.
Figure 2: Illustration of the proposed method for test-time adaptation. Following the training of a deep model on labeled source data, the pre-trained model is adapted to the target data through modifications to the BN layers during inference, which includes updating statistical information progressively and optimizing learnable parameters in a self-supervised manner.
Figure 3: Comparison of different BN adaptation methods.
Figure 4: The reliability constrained pseudo-labeling strategy, which jointly considers entropy-based confidence and contrastive consistency.
Figure 5: Three real-world TTA benchmarks. Based on the category distribution among the datasets, evaluation merges classes sharing the same color, while black-colored categories are excluded from evaluation.
...and 5 more figures

Test-time adaptation for geospatial point cloud semantic segmentation with distinct domain shifts

TL;DR

Abstract

Test-time adaptation for geospatial point cloud semantic segmentation with distinct domain shifts

Authors

TL;DR

Abstract

Table of Contents

Figures (10)