MamTiff-CAD: Multi-Scale Latent Diffusion with Mamba+ for Complex Parametric Sequence

Liyuan Deng; Yunpeng Bai; Yongkang Dai; Xiaoshui Huang; Hongping Gan; Dongshuo Huang; Hao jiacheng; Yilei Shi

MamTiff-CAD: Multi-Scale Latent Diffusion with Mamba+ for Complex Parametric Sequence

Liyuan Deng, Yunpeng Bai, Yongkang Dai, Xiaoshui Huang, Hongping Gan, Dongshuo Huang, Hao jiacheng, Yilei Shi

TL;DR

MamTiff-CAD tackles the challenge of generating long, complex parametric CAD command sequences by fusing a Forget Gate-enhanced Mamba+ encoder with a Transformer-based decoder and a latent Multi-Scale Transformer diffusion generator. The two-stage framework maps CAD sequences into a compact latent space, then learns their distribution with diffusion in that space, enabling robust reconstruction and high-quality unconditional generation for sequences up to $N_c=256$ commands. A new ABC-256 dataset with $13{,}705$ long sequences (60–256 commands) supports evaluation and demonstrates superior autoencoding and generation performance compared to prior CAD models, while generalizing to Fusion 360 data. These results advance industrial CAD design by enabling scalable, coherent generation of long parametric sequences and opening avenues for integrating Brep/CSG representations and interactive design workflows.

Abstract

Parametric Computer-Aided Design (CAD) is crucial in industrial applications, yet existing approaches often struggle to generate long sequence parametric commands due to complex CAD models' geometric and topological constraints. To address this challenge, we propose MamTiff-CAD, a novel CAD parametric command sequences generation framework that leverages a Transformer-based diffusion model for multi-scale latent representations. Specifically, we design a novel autoencoder that integrates Mamba+ and Transformer, to transfer parameterized CAD sequences into latent representations. The Mamba+ block incorporates a forget gate mechanism to effectively capture long-range dependencies. The non-autoregressive Transformer decoder reconstructs the latent representations. A diffusion model based on multi-scale Transformer is then trained on these latent embeddings to learn the distribution of long sequence commands. In addition, we also construct a dataset that consists of long parametric sequences, which is up to 256 commands for a single CAD model. Experiments demonstrate that MamTiff-CAD achieves state-of-the-art performance on both reconstruction and generation tasks, confirming its effectiveness for long sequence (60-256) CAD model generation.

MamTiff-CAD: Multi-Scale Latent Diffusion with Mamba+ for Complex Parametric Sequence

TL;DR

Abstract

MamTiff-CAD: Multi-Scale Latent Diffusion with Mamba+ for Complex Parametric Sequence

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (11)