Doodle Your 3D: From Abstract Freehand Sketches to Precise 3D Shapes
Hmrishav Bandyopadhyay, Subhadeep Koley, Ayan Das, Ayan Kumar Bhunia, Aneeshan Sain, Pinaki Nath Chowdhury, Tao Xiang, Yi-Zhe Song
TL;DR
This paper addresses generating precise 3D shapes from abstract freehand sketches without requiring paired sketch-3D datasets. It introduces a part-aware implicit representation and a latent diffusion model operating in a shared INR latent space, with cross-modal correspondence established by unsupervised part discovery and sketch-to-shape alignment via CLIPasso-derived edgemaps. The same part-level decoder supports both sketch modelling and 3D generation, and enables in-position editing by local part manipulation. The approach achieves efficient, high-fidelity 3D generation from highly abstract inputs, outperforming state-of-the-art sketch-conditioned methods on conditional metrics while generalizing to hand-drawn sketches and multi-view prompts. This paves the way for accessible, editable 3D content creation with reduced data requirements and computation.
Abstract
In this paper, we democratise 3D content creation, enabling precise generation of 3D shapes from abstract sketches while overcoming limitations tied to drawing skills. We introduce a novel part-level modelling and alignment framework that facilitates abstraction modelling and cross-modal correspondence. Leveraging the same part-level decoder, our approach seamlessly extends to sketch modelling by establishing correspondence between CLIPasso edgemaps and projected 3D part regions, eliminating the need for a dataset pairing human sketches and 3D shapes. Additionally, our method introduces a seamless in-position editing process as a byproduct of cross-modal part-aligned modelling. Operating in a low-dimensional implicit space, our approach significantly reduces computational demands and processing time.
