B-repLer: Language-guided Editing of CAD Models
Yilin Liu, Niladri Shekhar Dutt, Changjian Li, Niloy J. Mitra
TL;DR
The paper tackles the challenge of language-guided CAD editing by proposing B-repLer, which edits directly in the B-rep latent space without construction history. It introduces BrepEDIT-240K, a large-scale dataset built with a multimodal generation pipeline and CAD tooling to produce paired pre-/post-edit B-rep models with rich text annotations. The core method combines a variational autoregressive transformer with a flow matching network to produce diverse, valid edits that respect B-rep topology and complex freeform surfaces. Experimental results show improvements over state-of-the-art baselines in edit realism, prompt alignment, and the ability to handle high-level semantic instructions, enabling a scalable, native text-guided editing workflow for CAD models.
Abstract
Computer-Aided Design (CAD) models, given their compactness and precision, remain the industry standard for designing and fabricating engineering objects. However, language-guided CAD editing is still in its infancy, largely due to missing semantic connection between user commands and underlying shape geometry, a problem exacerbated by the shortage of paired text-and-edit CAD datasets. While recent Multimodal Large Language Models (mLLMs) have attempted to bridge this gap, their reliance on CAD construction history -- often an expensive and hard to obtain input -- severely limits their expressiveness and restricts their usage. We present B-repLer, a novel framework that directly connects natural language with editing CAD models by operating in a learned latent space. Importantly, our approach bypasses the need for construction history, enabling semantic edits on a wide range of geometries, from simple prismatic parts to complex freeform shapes defined by B-Spline surfaces. To facilitate this research, we introduce BrepEDIT-240K, the first large-scale dataset for this task. We demonstrate how this paired dataset can be automatically generated, (user) validated, and scaled by leveraging existing CAD tools, in conjunction with mLLMs, to create the required paired data without relying on any external annotations. Our results demonstrate that B-repLer can accurately perform complex edits on complex CAD shapes, even when the input edit specifications are high-level and ambiguous to interpret, consistently producing valid, high-quality CAD outputs enabling a class of text-guided edits not previously possible.
