ViewCraft3D: High-Fidelity and View-Consistent 3D Vector Graphics Synthesis
Chuang Wang, Haitao Zhou, Ling Luo, Qian Yu
TL;DR
ViewCraft3D (VC3D) tackles the challenge of generating high-fidelity, view-consistent 3D vector graphics from a single image. It leverages 3D priors and a two-stage optimization: Stage I fits cubic Bézier curves to salient 3D structure extracted from a mesh reconstructed by an image-to-3D model, and Stage II refines missing details with Score Distillation Sampling guided by a pretrained model. The method renders 3D curves to 2D views via differentiable projection and demonstrates superior view consistency and aesthetics while reducing generation time compared with prior 3D vector graphics approaches. This work improves accessibility to expressive 3D vector graphics for VR, shape retrieval, and conceptual design by avoiding heavy 2D priors and cross-view inconsistencies.
Abstract
3D vector graphics play a crucial role in various applications including 3D shape retrieval, conceptual design, and virtual reality interactions due to their ability to capture essential structural information with minimal representation. While recent approaches have shown promise in generating 3D vector graphics, they often suffer from lengthy processing times and struggle to maintain view consistency. To address these limitations, we propose ViewCraft3D (VC3D), an efficient method that leverages 3D priors to generate 3D vector graphics. Specifically, our approach begins with 3D object analysis, employs a geometric extraction algorithm to fit 3D vector graphics to the underlying structure, and applies view-consistent refinement process to enhance visual quality. Our comprehensive experiments demonstrate that VC3D outperforms previous methods in both qualitative and quantitative evaluations, while significantly reducing computational overhead. The resulting 3D sketches maintain view consistency and effectively capture the essential characteristics of the original objects.
