Image-Space Collage and Packing with Differentiable Rendering
Zhenyu Wang, Min Lu
TL;DR
The paper shifts 2D collage and packing from object-space optimization to image-space optimization using differentiable rendering, enabling gradient-based fitting of diverse shapes without task-specific geometric descriptors. By representing each item with $N$ cubic Bézier curves and optimizing translations $\mathbf{t}$, rotations $\mathbf{r}$, and scales $\mathbf{s}$ to minimize a composite loss $\mathcal{L}=\alpha\mathcal{L}_{containment}+\beta\mathcal{L}_{overlap}+\gamma\mathcal{L}_{uniform}$, the method achieves robust containment, minimal overlap, and uniform distribution. A hierarchical resolution strategy—from $50\times50$ up to $600\times600$—drives fast convergence while preserving accuracy. Empirical results show order-of-magnitude speedups over baselines and demonstrate versatility across icons, text, and data-visualization collages, with ablations validating the contributions of uniform loss and multiscale optimization. The work suggests promising extensions to interactive and text-driven collage methods and potential hybridization with object-space losses for enhanced control and quality.
Abstract
Collage and packing techniques are widely used to organize geometric shapes into cohesive visual representations, facilitating the representation of visual features holistically, as seen in image collages and word clouds. Traditional methods often rely on object-space optimization, requiring intricate geometric descriptors and energy functions to handle complex shapes. In this paper, we introduce a versatile image-space collage technique. Leveraging a differentiable renderer, our method effectively optimizes the object layout with image-space losses, bringing the benefit of fixed complexity and easy accommodation of various shapes. Applying a hierarchical resolution strategy in image space, our method efficiently optimizes the collage with fast convergence, large coarse steps first and then small precise steps. The diverse visual expressiveness of our approach is demonstrated through various examples. Experimental results show that our method achieves an order-of-magnitude speedup compared to state-of-the-art techniques. The project page is https://szuviz.github.io/pixel-space-collage-technique/.
