3D Reconstruction from Sketches
Abhimanyu Talwar, Julien Laasri
TL;DR
This work tackles 3D reconstruction from sketches by integrating a stitching-based front end with a sketch-to-image translation (CycleGAN) and a depth estimator (MegaDepth) to produce textured 3D surfaces. A dataset of image-sketch pairs derived from the Zurich Building Database enables training of the CycleGAN to convert sketches into realistic images suitable for depth inference. The results show that stitching across real, style-variant drawings is brittle, but the single-sketch pathway yields plausible 3D reconstructions and depth maps for a variety of drawings. The approach offers a path to modeling pre-photography scenes and can be extended with more artist diversity and domain-specific fine-tuning of depth estimators to improve robustness and accuracy.
Abstract
We consider the problem of reconstructing a 3D scene from multiple sketches. We propose a pipeline which involves (1) stitching together multiple sketches through use of correspondence points, (2) converting the stitched sketch into a realistic image using a CycleGAN, and (3) estimating that image's depth-map using a pre-trained convolutional neural network based architecture called MegaDepth. Our contribution includes constructing a dataset of image-sketch pairs, the images for which are from the Zurich Building Database, and sketches have been generated by us. We use this dataset to train a CycleGAN for our pipeline's second step. We end up with a stitching process that does not generalize well to real drawings, but the rest of the pipeline that creates a 3D reconstruction from a single sketch performs quite well on a wide variety of drawings.
