RLCAD: Reinforcement Learning Training Gym for Revolution Involved CAD Command Sequence Generation

Xiaolong Yin; Xingyu Lu; Jiahang Shen; Jingzhe Ni; Hailong Li; Ruofeng Tong; Min Tang; Peng Du

RLCAD: Reinforcement Learning Training Gym for Revolution Involved CAD Command Sequence Generation

Xiaolong Yin, Xingyu Lu, Jiahang Shen, Jingzhe Ni, Hailong Li, Ruofeng Tong, Min Tang, Peng Du

TL;DR

This work presents RLCAD, a geometry-engine-backed RL training gym for generating CAD command sequences from boundary representations, extending support from 2D sketches to revolution operations. It integrates a UV-Net–based B-Rep encoder, a cross-modal policy using GTrXL, and a PPO-based learning loop with a hybrid reward that combines global and local geometric cues. A 20k-model ABC-derived dataset and curriculum-driven pretraining enable robust generalization to complex CAD tasks, achieving state-of-the-art reconstruction quality compared to baselines like Fusion 360 Gallery and CAD-Recode. The framework demonstrates substantial improvements in efficiency and fidelity, with broader implications for automated parametric CAD generation and robotics-inspired CAD workflows.

Abstract

A CAD command sequence is a typical parametric design paradigm in 3D CAD systems where a model is constructed by overlaying 2D sketches with operations such as extrusion, revolution, and Boolean operations. Although there is growing academic interest in the automatic generation of command sequences, existing methods and datasets only support operations such as 2D sketching, extrusion,and Boolean operations. This limitation makes it challenging to represent more complex geometries. In this paper, we present a reinforcement learning (RL) training environment (gym) built on a CAD geometric engine. Given an input boundary representation (B-Rep) geometry, the policy network in the RL algorithm generates an action. This action, along with previously generated actions, is processed within the gym to produce the corresponding CAD geometry, which is then fed back into the policy network. The rewards, determined by the difference between the generated and target geometries within the gym, are used to update the RL network. Our method supports operations beyond sketches, Boolean, and extrusion, including revolution operations. With this training gym, we achieve state-of-the-art (SOTA) quality in generating command sequences from B-Rep geometries.

RLCAD: Reinforcement Learning Training Gym for Revolution Involved CAD Command Sequence Generation

TL;DR

Abstract

RLCAD: Reinforcement Learning Training Gym for Revolution Involved CAD Command Sequence Generation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (10)