Sketch2Prototype: Rapid Conceptual Design Exploration and Prototyping with Generative AI
Kristen M. Edwards, Brandon Man, Faez Ahmed
TL;DR
The paper addresses rapid conceptual design exploration by transforming hand-drawn sketches into text descriptions, diverse 2D images, and 3D prototypes ready for fabrication. It introduces a multi-stage pipeline—sketch-to-text via GPT-4V, text-to-image via DALL-E 3, and image-to-3D via multiple predictors—with Blender-based post-processing and additive manufacturing. Results show that employing text as an intermediate modality yields more diverse and manufacturable 3D designs than direct sketch-to-3D baselines, while also enabling user-driven iterative refinement through prompts. An open-source dataset of 1,087 milk-frother sketches with 4 generated images per sketch (4,348 images total) demonstrates alignment and diversity, underscoring the practical potential of AI-assisted design workflows and the value of human-in-the-loop feedback in early-stage prototyping.
Abstract
Sketch2Prototype is an AI-based framework that transforms a hand-drawn sketch into a diverse set of 2D images and 3D prototypes through sketch-to-text, text-to-image, and image-to-3D stages. This framework, shown across various sketches, rapidly generates text, image, and 3D modalities for enhanced early-stage design exploration. We show that using text as an intermediate modality outperforms direct sketch-to-3D baselines for generating diverse and manufacturable 3D models. We find limitations in current image-to-3D techniques, while noting the value of the text modality for user-feedback and iterative design augmentation.
