Image2CADSeq: Computer-Aided Design Sequence and Knowledge Inference from Product Images
Xingang Li, Zhenghui Sha
TL;DR
This paper introduces Image2CADSeq, a data-driven approach to predict sequences of CAD operations from a single image using a target-embedding variational autoencoder (TEVAE). It leverages a Fusion 360 Gallery domain-specific language to define and vectorize CAD programs, and a two-stage training pipeline (Stage 1: CAD-sequence latent learning; Stage 2: image-to-latent regression) aided by a synthetic data generation pipeline. A comprehensive, multi-level evaluation framework assesses CAD programs, 3D models, and images, showing that incorporating design rules and TEVAE yields the best performance, with notable improvements over baseline AE and real-world parsing challenges. The work demonstrates potential for accelerating CAD reconstruction, enabling design knowledge capture and democratization, while also outlining routes to handle more complex geometries and real-world data.
Abstract
Computer-aided design (CAD) tools empower designers to design and modify 3D models through a series of CAD operations, commonly referred to as a CAD sequence. In scenarios where digital CAD files are not accessible, reverse engineering (RE) has been used to reconstruct 3D CAD models. Recent advances have seen the rise of data-driven approaches for RE, with a primary focus on converting 3D data, such as point clouds, into 3D models in boundary representation (B-rep) format. However, obtaining 3D data poses significant challenges, and B-rep models do not reveal knowledge about the 3D modeling process of designs. To this end, our research introduces a novel data-driven approach with an Image2CADSeq neural network model. This model aims to reverse engineer CAD models by processing images as input and generating CAD sequences. These sequences can then be translated into B-rep models using a solid modeling kernel. Unlike B-rep models, CAD sequences offer enhanced flexibility to modify individual steps of model creation, providing a deeper understanding of the construction process of CAD models. To quantitatively and rigorously evaluate the predictive performance of the Image2CADSeq model, we have developed a multi-level evaluation framework for model assessment. The model was trained on a specially synthesized dataset, and various network architectures were explored to optimize the performance. The experimental and validation results show great potential for the model in generating CAD sequences from 2D image data.
