SVG360: Multi-View SVG Generation with Geometric and Color Consistency from a Single SVG
Mengnan Jiang, Zhaolin Sun, Christian Franke, Michele Franco Adesso, Antonio Haas, Grace Li Zhang
TL;DR
SVG360 tackles the problem of producing multi-view, fully editable SVGs from a single input by integrating a three-stage pipeline: 3D-aware multi-view raster generation with appearance harmonization, spatially aligned segmentation propagation across a viewing-sphere, and vector-domain consolidation to yield compact, coherent vector paths. The method leverages Trellis for 3D-based raster synthesis, Gaussian splatting for efficient multi-view rendering, and a Spatial-SAM2 module that propagates part-level segmentation using a sphere-based proximity metric $d(\theta_i,\theta_j)=\mathrm{atan2}(\|u_i \times u_j\|, \mathrm{clip}(u_i\cdot u_j,-1,1))$. It then converts raster segments to vector paths via VTracer, followed by color and topology consolidation including color mapping against a reference palette using $CIEDE2000$ distance to ensure cross-view consistency. Quantitative results show reduced path counts and color drift across views compared to Turntable, demonstrating improved geometric stability and editability for design workflows and enabling more scalable asset creation and semantic vector editing.
Abstract
Scalable Vector Graphics (SVGs) are central to modern design workflows, offering scaling without distortion and precise editability. However, for single object SVGs, generating multi-view consistent SVGs from a single-view input remains underexplored. We present a three stage framework that produces multi-view SVGs with geometric and color consistency from a single SVG input. First, the rasterized input is lifted to a 3D representation and rendered under target camera poses, producing multi-view images of the object. Next, we extend the temporal memory mechanism of Segment Anything 2 (SAM2) to the spatial domain, constructing a spatial memory bank that establishes part level correspondences across neighboring views, yielding cleaner and more consistent vector paths and color assignments without retraining. Finally, during the raster to vector conversion, we perform path consolidation and structural optimization to reduce redundancy while preserving boundaries and semantics. The resulting SVGs exhibit strong geometric and color consistency across views, significantly reduce redundant paths, and retain fine structural details. This work bridges generative modeling and structured vector representation, providing a scalable route to single input, object level multi-view SVG generation and supporting applications such as asset creation and semantic vector editing.
