ABO: Dataset and Benchmarks for Real-World 3D Object Understanding
Jasmine Collins, Shubham Goel, Kenan Deng, Achleshwar Luthra, Leon Xu, Erhan Gundogdu, Xi Zhang, Tomas F. Yago Vicente, Thomas Dideriksen, Himanshu Arora, Matthieu Guillaumin, Jitendra Malik
TL;DR
The paper introduces ABO, a large-scale dataset linking real-world product imagery with high-quality artist-created 3D meshes and physically-based materials to benchmark real-world 3D understanding. It evaluates three tasks—single-view 3D reconstruction, material estimation, and multi-view cross-domain retrieval—revealing significant domain gaps when transferring from synthetic datasets and demonstrating the value of multi-view data for SV-BRDF estimation. Key contributions include a comprehensive ABO data release with rich metadata, automated 6-DOF pose annotations, a photorealistic material dataset, and a challenging MVR benchmark with geometry-aware evaluation. The results highlight the limitations of ShapeNet-trained models on real-world objects and establish ABO as a catalyst for more realistic 3D object understanding research with practical implications for vision, rendering, and robotics.
Abstract
We introduce Amazon Berkeley Objects (ABO), a new large-scale dataset designed to help bridge the gap between real and virtual 3D worlds. ABO contains product catalog images, metadata, and artist-created 3D models with complex geometries and physically-based materials that correspond to real, household objects. We derive challenging benchmarks that exploit the unique properties of ABO and measure the current limits of the state-of-the-art on three open problems for real-world 3D object understanding: single-view 3D reconstruction, material estimation, and cross-domain multi-view object retrieval.
