Inter3D: A Benchmark and Strong Baseline for Human-Interactive 3D Object Reconstruction

Gan Chen; Ying He; Mulin Yu; F. Richard Yu; Gang Xu; Fei Ma; Ming Li; Guang Zhou

Inter3D: A Benchmark and Strong Baseline for Human-Interactive 3D Object Reconstruction

Gan Chen, Ying He, Mulin Yu, F. Richard Yu, Gang Xu, Fei Ma, Ming Li, Guang Zhou

TL;DR

Inter3D tackles the intractable problem of modeling human-interactive objects with $n$ movable parts, where there are $2^n$ discrete states. It introduces a benchmark with a self-collected dataset and a novel evaluation protocol restricting training to canonical and individual-part states, while unseen combination states are tested, and presents a baseline method that combines Space Discrepancy Tensors with multi-resolution hash encoding via InstantNGP. The approach employs a Mutual State Regularization to maintain cross-state consistency and offers two occupancy-grid strategies to balance training speed and memory usage. Experimental results on four object categories show strong performance on novel state synthesis and clear advantages over existing static/dynamic 3D methods. These contributions provide a practical foundation for scalable, interactive 3D object reconstruction and synthesis.

Abstract

Recent advancements in implicit 3D reconstruction methods, e.g., neural rendering fields and Gaussian splatting, have primarily focused on novel view synthesis of static or dynamic objects with continuous motion states. However, these approaches struggle to efficiently model a human-interactive object with n movable parts, requiring 2^n separate models to represent all discrete states. To overcome this limitation, we propose Inter3D, a new benchmark and approach for novel state synthesis of human-interactive objects. We introduce a self-collected dataset featuring commonly encountered interactive objects and a new evaluation pipeline, where only individual part states are observed during training, while part combination states remain unseen. We also propose a strong baseline approach that leverages Space Discrepancy Tensors to efficiently modelling all states of an object. To alleviate the impractical constraints on camera trajectories across training states, we propose a Mutual State Regularization mechanism to enhance the spatial density consistency of movable parts. In addition, we explore two occupancy grid sampling strategies to facilitate training efficiency. We conduct extensive experiments on the proposed benchmark, showcasing the challenges of the task and the superiority of our approach.

Inter3D: A Benchmark and Strong Baseline for Human-Interactive 3D Object Reconstruction

TL;DR

Abstract

Inter3D: A Benchmark and Strong Baseline for Human-Interactive 3D Object Reconstruction

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (12)