Deep Part Induction from Articulated Object Pairs

Li Yi; Haibin Huang; Difan Liu; Evangelos Kalogerakis; Hao Su; Leonidas Guibas

Deep Part Induction from Articulated Object Pairs

Li Yi, Haibin Huang, Difan Liu, Evangelos Kalogerakis, Hao Su, Leonidas Guibas

TL;DR

This work tackles mobility-based part induction from articulated object pairs by proposing a class-agnostic, data-driven pipeline that jointly learns correspondences, 3D deformation flows, and segmentation of moving parts. The approach combines three neural modules—Correspondence Proposal, Flow (PairNet), and Segmentation (RPEN)—and iteratively refines predictions in an ICP-like loop to reveal piecewise rigid structures despite geometric differences and noisy data. It is trained on a large synthetic dataset with ground-truth part correspondences and motions, and demonstrated to outperform state-of-the-art baselines on both synthetic and real datasets, with strong generalization to unseen object categories. The framework enables applications in shape animation and shape–image joint analysis, offering a robust tool for functional understanding of articulated objects.

Abstract

Object functionality is often expressed through part articulation -- as when the two rigid parts of a scissor pivot against each other to perform the cutting function. Such articulations are often similar across objects within the same functional category. In this paper, we explore how the observation of different articulation states provides evidence for part structure and motion of 3D objects. Our method takes as input a pair of unsegmented shapes representing two different articulation states of two functionally related objects, and induces their common parts along with their underlying rigid motion. This is a challenging setting, as we assume no prior shape structure, no prior shape category information, no consistent shape orientation, the articulation states may belong to objects of different geometry, plus we allow inputs to be noisy and partial scans, or point clouds lifted from RGB images. Our method learns a neural network architecture with three modules that respectively propose correspondences, estimate 3D deformation flows, and perform segmentation. To achieve optimal performance, our architecture alternates between correspondence, deformation flow, and segmentation prediction iteratively in an ICP-like fashion. Our results demonstrate that our method significantly outperforms state-of-the-art techniques in the task of discovering articulated parts of objects. In addition, our part induction is object-class agnostic and successfully generalizes to new and unseen objects.

Deep Part Induction from Articulated Object Pairs

TL;DR

Abstract

Deep Part Induction from Articulated Object Pairs

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (12)