DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models
Ximing Xing, Chuang Wang, Haitao Zhou, Jing Zhang, Qian Yu, Dong Xu
TL;DR
DiffSketcher tackles the problem of generating free-hand vector sketches from text by leveraging a pretrained latent diffusion prior and a differentiable rasterizer to optimize Bézier strokes. The approach extends Score Distillation Sampling (SDS) to guide vector-graphic parameters, and introduces an attention-guided stroke initialization plus an opacity-aware, augmentation-based loss to capture brush-like quality while preserving semantic fidelity. Key contributions include (1) a text-to-sketch diffusion framework for object- and scene-level vector sketches, (2) three quality-enhancing strategies (extended SDS, attention-based initialization, and stroke opacity), and (3) comprehensive experiments showing improved semantic alignment, aesthetics, and recognizability over prior methods. The method enables efficient, scalable text-driven vector sketching with potential applications in design and education, and highlights future directions for better abstractness control and style transfer.
Abstract
Even though trained mainly on images, we discover that pretrained diffusion models show impressive power in guiding sketch synthesis. In this paper, we present DiffSketcher, an innovative algorithm that creates \textit{vectorized} free-hand sketches using natural language input. DiffSketcher is developed based on a pre-trained text-to-image diffusion model. It performs the task by directly optimizing a set of Bézier curves with an extended version of the score distillation sampling (SDS) loss, which allows us to use a raster-level diffusion model as a prior for optimizing a parametric vectorized sketch generator. Furthermore, we explore attention maps embedded in the diffusion model for effective stroke initialization to speed up the generation process. The generated sketches demonstrate multiple levels of abstraction while maintaining recognizability, underlying structure, and essential visual details of the subject drawn. Our experiments show that DiffSketcher achieves greater quality than prior work. The code and demo of DiffSketcher can be found at https://ximinng.github.io/DiffSketcher-project/.
