GARF: Learning Generalizable 3D Reassembly for Real-World Fractures
Sihang Li, Zeyu Jiang, Grace Chen, Chenyang Xu, Siqi Tan, Xue Wang, Irving Fang, Kristof Zyskowski, Shannon P. McPherron, Radu Iovita, Chen Feng, Jing Zhang
TL;DR
This work addresses the challenge of generalizing 3D fracture reassembly from synthetic data to real-world fractures. It introduces GARF, a flow-based SE(3) framework augmented with fracture-aware pretraining, two-session inference, multi-anchor training, and LoRA fine-tuning, and validates it on Fractura, a diverse real/synthetic fracture dataset. GARF achieves state-of-the-art performance across synthetic and real-world scenarios, notably reducing rotation error and boosting part accuracy, while demonstrating robustness to missing or extraneous fragments. The Fractura dataset and GARF’s generalization capabilities offer a new benchmark and methodology for cross-domain 3D puzzle solving in archaeology, paleontology, and related fields.
Abstract
3D reassembly is a challenging spatial intelligence task with broad applications across scientific domains. While large-scale synthetic datasets have fueled promising learning-based approaches, their generalizability to different domains is limited. Critically, it remains uncertain whether models trained on synthetic datasets can generalize to real-world fractures where breakage patterns are more complex. To bridge this gap, we propose GARF, a generalizable 3D reassembly framework for real-world fractures. GARF leverages fracture-aware pretraining to learn fracture features from individual fragments, with flow matching enabling precise 6-DoF alignments. At inference time, we introduce one-step preassembly, improving robustness to unseen objects and varying numbers of fractures. In collaboration with archaeologists, paleoanthropologists, and ornithologists, we curate Fractura, a diverse dataset for vision and learning communities, featuring real-world fracture types across ceramics, bones, eggshells, and lithics. Comprehensive experiments have shown our approach consistently outperforms state-of-the-art methods on both synthetic and real-world datasets, achieving 82.87\% lower rotation error and 25.15\% higher part accuracy. This sheds light on training on synthetic data to advance real-world 3D puzzle solving, demonstrating its strong generalization across unseen object shapes and diverse fracture types. GARF's code, data and demo are available at https://ai4ce.github.io/GARF/.
