PartStickers: Generating Parts of Objects for Rapid Prototyping
Mo Zhou, Josh Myers-Dean, Danna Gurari
TL;DR
PartStickers tackles the challenge of generating isolated object parts on neutral backgrounds from text prompts using a diffusion-based approach. It trains a model via LoRA fine-tuning on a dataset of part stickers created from part segmentation masks, employing a two-stage pipeline that pastes masked parts onto a gray canvas and uses prompts of the form $\text{a [PART] of a [OBJECT]}$. The diffusion process operates over $T$ timesteps with a variance schedule $\{\beta_t\}_{t=1}^T$ and optimizes the denoising loss $L_{denoise}$ to produce realistic, part-focused outputs that align with text prompts. Empirically, PartStickers achieves superior realism and text-part alignment compared with baselines on PartImageNet-derived data, demonstrating reliable part isolation, neutral backgrounds, and center-focused generation suitable for rapid prototyping, with implications for streamlined design workflows and remix-driven creativity.
Abstract
Design prototyping involves creating mockups of products or concepts to gather feedback and iterate on ideas. While prototyping often requires specific parts of objects, such as when constructing a novel creature for a video game, existing text-to-image methods tend to only generate entire objects. To address this, we propose a novel task and method of ``part sticker generation", which entails generating an isolated part of an object on a neutral background. Experiments demonstrate our method outperforms state-of-the-art baselines with respect to realism and text alignment, while preserving object-level generation capabilities. We publicly share our code and models to encourage community-wide progress on this new task: https://partsticker.github.io.
