An Empty Room is All We Want: Automatic Defurnishing of Indoor Panoramas
Mira Slavcheva, Dave Gausebeck, Kevin Chen, David Buchhofer, Azwad Sabik, Chen Ma, Sachal Dhillon, Olaf Brandt, Alan Dolhasz
TL;DR
This work tackles automatic defurnishing of indoor panoramas by creating a domain-tuned inpainting pipeline built on Stable Diffusion. It foregrounds context-rich equirectangular panoramas, localizes furniture via semantic segmentation, and uses a robust inpainting model trained with synthetic shadows and a diverse set of prompts to mitigate hallucinations without relying on room layout estimation. A targeted pre-processing and a post-processing blend preserve high-frequency detail, resulting in crisper textures and lower perceptual distortion than competing methods, as demonstrated by quantitative metrics and qualitative examples. The approach advances practical digital twin workflows by enabling consistent, high-fidelity defurnishing suitable for real estate visualization, interior design exploration, and renovation planning.
Abstract
We propose a pipeline that leverages Stable Diffusion to improve inpainting results in the context of defurnishing -- the removal of furniture items from indoor panorama images. Specifically, we illustrate how increased context, domain-specific model fine-tuning, and improved image blending can produce high-fidelity inpaints that are geometrically plausible without needing to rely on room layout estimation. We demonstrate qualitative and quantitative improvements over other furniture removal techniques.
