CADCrafter: Generating Computer-Aided Design Models from Unconstrained Images

Cheng Chen; Jiacheng Wei; Tianrun Chen; Chi Zhang; Xiaofeng Yang; Shangzhan Zhang; Bingchen Yang; Chuan-Sheng Foo; Guosheng Lin; Qixing Huang; Fayao Liu

CADCrafter: Generating Computer-Aided Design Models from Unconstrained Images

Cheng Chen, Jiacheng Wei, Tianrun Chen, Chi Zhang, Xiaofeng Yang, Shangzhan Zhang, Bingchen Yang, Chuan-Sheng Foo, Guosheng Lin, Qixing Huang, Fayao Liu

TL;DR

CADCrafter presents a latent diffusion framework that converts unconstrained images into editable parametric CAD command sequences by conditioning on geometry features extracted from depth and normal maps. Training solely on synthetic data, it employs a geometry-conditioned diffusion model, multi-view to single-view distillation, and a direct-preference optimization (DPO) based code checker to enforce geometric validity. The approach yields strong performance on synthetic and real-world data, outperforming baselines in command/parameter accuracy, geometric fidelity, and output validity, while enabling single-view diversity and robust real-world generalization. This work enables practical CAD generation from casual imagery with potential for scalable digital twins and manufacturing workflows by bridging the synthetic-real domain gap through geometry-aware conditioning and compiler-guided fine-tuning.

Abstract

Creating CAD digital twins from the physical world is crucial for manufacturing, design, and simulation. However, current methods typically rely on costly 3D scanning with labor-intensive post-processing. To provide a user-friendly design process, we explore the problem of reverse engineering from unconstrained real-world CAD images that can be easily captured by users of all experiences. However, the scarcity of real-world CAD data poses challenges in directly training such models. To tackle these challenges, we propose CADCrafter, an image-to-parametric CAD model generation framework that trains solely on synthetic textureless CAD data while testing on real-world images. To bridge the significant representation disparity between images and parametric CAD models, we introduce a geometry encoder to accurately capture diverse geometric features. Moreover, the texture-invariant properties of the geometric features can also facilitate the generalization to real-world scenarios. Since compiling CAD parameter sequences into explicit CAD models is a non-differentiable process, the network training inherently lacks explicit geometric supervision. To impose geometric validity constraints, we employ direct preference optimization (DPO) to fine-tune our model with the automatic code checker feedback on CAD sequence quality. Furthermore, we collected a real-world dataset, comprised of multi-view images and corresponding CAD command sequence pairs, to evaluate our method. Experimental results demonstrate that our approach can robustly handle real unconstrained CAD images, and even generalize to unseen general objects.

CADCrafter: Generating Computer-Aided Design Models from Unconstrained Images

TL;DR

Abstract

CADCrafter: Generating Computer-Aided Design Models from Unconstrained Images

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)