Object Placement for Anything
Bingjie Gao, Bo Zhang, Li Niu
TL;DR
This work tackles open-set object placement by addressing the limited labeled data problem through a semi-supervised framework that jointly leverages small labeled datasets and large unlabeled image collections. It introduces similarity variation transfer, enabling knowledge about how placement changes affect rationality to propagate from labeled to unlabeled data via a shared similarity classifier and adversarial domain alignment. The approach is complemented by constructing the Open Object Placement Assessment (OOPA) dataset from Open Images and extensive experiments showing improved generalization, plausibility, and diversity over state-of-the-art methods. Overall, the method demonstrates that incorporating unlabeled data with targeted loss components yields more robust and transferable object placement models for real-world scenes.
Abstract
Object placement aims to determine the appropriate placement (\emph{e.g.}, location and size) of a foreground object when placing it on the background image. Most previous works are limited by small-scale labeled dataset, which hinders the real-world application of object placement. In this work, we devise a semi-supervised framework which can exploit large-scale unlabeled dataset to promote the generalization ability of discriminative object placement models. The discriminative models predict the rationality label for each foreground placement given a foreground-background pair. To better leverage the labeled data, under the semi-supervised framework, we further propose to transfer the knowledge of rationality variation, \emph{i.e.}, whether the change of foreground placement would result in the change of rationality label, from labeled data to unlabeled data. Extensive experiments demonstrate that our framework can effectively enhance the generalization ability of discriminative object placement models.
