PuzzleFusion++: Auto-agglomerative 3D Fracture Assembly by Denoise and Verify
Zhengqing Wang, Jiacheng Chen, Yasutaka Furukawa
TL;DR
PuzzleFusion++ addresses 3D fracture assembly by proposing an auto-agglomerative pipeline that iteratively aligns and merges fragments. It combines a diffusion-based $SE(3)$ denoiser with a transformer-based verifier to progressively form larger fragments across up to six iterations, emulating human puzzle-solving strategies. On the Breaking Bad dataset, it achieves state-of-the-art performance, with notable improvements in $PA$ and Chamfer distance, and demonstrates robustness through extensive ablations and qualitative analyses. The work offers a fully neural, end-to-end framework with potential impact on archaeology, forensics, and related fields requiring accurate 3D reconstruction from fragmented objects.
Abstract
This paper proposes a novel "auto-agglomerative" 3D fracture assembly method, PuzzleFusion++, resembling how humans solve challenging spatial puzzles. Starting from individual fragments, the approach 1) aligns and merges fragments into larger groups akin to agglomerative clustering and 2) repeats the process iteratively in completing the assembly akin to auto-regressive methods. Concretely, a diffusion model denoises the 6-DoF alignment parameters of the fragments simultaneously, and a transformer model verifies and merges pairwise alignments into larger ones, whose process repeats iteratively. Extensive experiments on the Breaking Bad dataset show that PuzzleFusion++ outperforms all other state-of-the-art techniques by significant margins across all metrics, in particular by over 10% in part accuracy and 50% in Chamfer distance. The code will be available on our project page: https://puzzlefusion-plusplus.github.io.
