Multi-View Pose-Agnostic Change Localization with Zero Labels
Chamuditha Jayanga Galappaththige, Jason Lai, Lloyd Windrim, Donald Dansereau, Niko Suenderhauf, Dimity Miller
TL;DR
This work tackles label-free, pose-agnostic change localization in unconstrained 3D environments by introducing Change-3DGS, a multi-view 3D Gaussian Splatting representation that explicitly encodes change at the 3D level. The approach builds a reference 3DGS from pre-change views, derives feature- and structure-aware per-view change cues (via DINOv2 features and SSIM), and learns per-Gaussian change channels to generate multi-view change masks for unseen viewpoints. It further uses data augmentation and initialization strategies to leverage prior scene structure, achieving state-of-the-art results on multi-object scenes and across challenging datasets (MAD-Real, ChangeSim, PASLCD), while introducing PASLCD as a real-world benchmark with lighting variations. The method demonstrates robust performance with limited inference views and provides a versatile, multi-view extension that can enhance existing change-detection pipelines by enforcing cross-view consistency in a 3D scene representation.
Abstract
Autonomous agents often require accurate methods for detecting and localizing changes in their environment, particularly when observations are captured from unconstrained and inconsistent viewpoints. We propose a novel label-free, pose-agnostic change detection method that integrates information from multiple viewpoints to construct a change-aware 3D Gaussian Splatting (3DGS) representation of the scene. With as few as 5 images of the post-change scene, our approach can learn an additional change channel in a 3DGS and produce change masks that outperform single-view techniques. Our change-aware 3D scene representation additionally enables the generation of accurate change masks for unseen viewpoints. Experimental results demonstrate state-of-the-art performance in complex multi-object scenes, achieving a 1.7x and 1.5x improvement in Mean Intersection Over Union and F1 score respectively over other baselines. We also contribute a new real-world dataset to benchmark change detection in diverse challenging scenes in the presence of lighting variations.
