Single-image coherent reconstruction of objects and humans
Sarthak Batra, Partha P. Chakrabarti, Simon Hadfield, Armin Mustafa
TL;DR
This work tackles monocular 3D reconstruction of scenes with multiple interacting humans and objects by introducing an optimization-based framework that enforces global spatial coherence. It jointly reasons about human–human and human–object interactions through a collision loss, an occlusion-aware 6-DOF object pose estimation via image inpainting, and a differentiable rendering-based fitting of object exemplars, all without explicit 3D supervision. A final joint optimization yields globally consistent scene layouts with reduced mesh collisions and more realistic interactions, demonstrated on COCO-2017 against state-of-the-art methods. The approach advances single-image scene understanding by producing coherent, collision-free reconstructions in complex real-world images, though it incurs higher computational cost compared to learning-based methods.
Abstract
Existing methods for reconstructing objects and humans from a monocular image suffer from severe mesh collisions and performance limitations for interacting occluding objects. This paper introduces a method to obtain a globally consistent 3D reconstruction of interacting objects and people from a single image. Our contributions include: 1) an optimization framework, featuring a collision loss, tailored to handle human-object and human-human interactions, ensuring spatially coherent scene reconstruction; and 2) a novel technique to robustly estimate 6 degrees of freedom (DOF) poses, specifically for heavily occluded objects, exploiting image inpainting. Notably, our proposed method operates effectively on images from real-world scenarios, without necessitating scene or object-level 3D supervision. Extensive qualitative and quantitative evaluation against existing methods demonstrates a significant reduction in collisions in the final reconstructions of scenes with multiple interacting humans and objects and a more coherent scene reconstruction.
