Seek-CAD: A Self-refined Generative Modeling for 3D Parametric CAD Using Local Inference via DeepSeek

Xueyang Li; Jiahao Li; Yu Song; Yunzhong Lou; Xiangdong Zhou

Seek-CAD: A Self-refined Generative Modeling for 3D Parametric CAD Using Local Inference via DeepSeek

Xueyang Li, Jiahao Li, Yu Song, Yunzhong Lou, Xiangdong Zhou

TL;DR

Seek-CAD tackles the problem of generating high-fidelity 3D parametric CAD models without fine-tuning large LLMs by leveraging a locally deployable open-source reasoning model (DeepSeek-R1) in a training-free framework. It introduces a retrieval-augmented generation pipeline and a novel SSR (Sketch, Sketch-based feature, Refinements) design paradigm, augmented with step-wise visual feedback and Chain-of-Thought alignment via Gemini-2.0 to iteratively refine CAD code. A CapType-based reference mechanism enables precise refinement of complex geometry, and a 40k-sample SSR CAD dataset supports practical industrial modeling needs. Experimental results show Seek-CAD achieves high geometric fidelity (CD/HD), accurate target descriptions (IoGT, G-Score), and meaningful diversity, highlighting the practicality of open-source, cost-efficient AI-assisted design workflows.

Abstract

The advent of Computer-Aided Design (CAD) generative modeling will significantly transform the design of industrial products. The recent research endeavor has extended into the realm of Large Language Models (LLMs). In contrast to fine-tuning methods, training-free approaches typically utilize the advanced closed-source LLMs, thereby offering enhanced flexibility and efficiency in the development of AI agents for generating CAD parametric models. However, the substantial cost and limitations of local deployment of the top-tier closed-source LLMs pose challenges in practical applications. The Seek-CAD is the pioneer exploration of locally deployed open-source inference LLM DeepSeek-R1 for CAD parametric model generation with a training-free methodology. This study is the first investigation to incorporate both visual and Chain-of-Thought (CoT) feedback within the self-refinement mechanism for generating CAD models. Specifically, the initial generated parametric CAD model is rendered into a sequence of step-wise perspective images, which are subsequently processed by a Vision Language Model (VLM) alongside the corresponding CoTs derived from DeepSeek-R1 to assess the CAD model generation. Then, the feedback is utilized by DeepSeek-R1 to refine the initial generated model for the next round of generation. Moreover, we present an innovative 3D CAD model dataset structured around the SSR (Sketch, Sketch-based feature, and Refinements) triple design paradigm. This dataset encompasses a wide range of CAD commands, thereby aligning effectively with industrial application requirements and proving suitable for the generation of LLMs. Extensive experiments validate the effectiveness of Seek-CAD under various metrics.

Seek-CAD: A Self-refined Generative Modeling for 3D Parametric CAD Using Local Inference via DeepSeek

TL;DR

Abstract

Seek-CAD: A Self-refined Generative Modeling for 3D Parametric CAD Using Local Inference via DeepSeek

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (14)