Inverse design with conditional cascaded diffusion models
Milad Habibi, Mark Fuge
TL;DR
This work introduces a conditional cascaded diffusion (cCDM) framework for multi-resolution inverse design in topology optimization and benchmarks it against a transfer-learning cGAN. By cascading a low-resolution diffusion stage with a high-resolution super-resolution diffusion stage, conditioned on boundary data and physics channels, the approach achieves superior pixel-level fidelity and compliance performance when high-resolution data are plentiful. The study quantifies how performance scales with high-resolution data size, identifying a break-even around 102 samples where cGANs outperform cCDM under data scarcity, while also showing that prioritizing a strong low-resolution predictor markedly improves downstream high-resolution results. Overall, cCDM demonstrates strong potential for efficient, high-fidelity multi-resolution design in engineering, with clear guidance on data requirements and future directions to reduce inference time and extend to 3D problems.
Abstract
Adjoint-based design optimizations are usually computationally expensive and those costs scale with resolution. To address this, researchers have proposed machine learning approaches for inverse design that can predict higher-resolution solutions from lower cost/resolution ones. Due to the recent success of diffusion models over traditional generative models, we extend the use of diffusion models for multi-resolution tasks by proposing the conditional cascaded diffusion model (cCDM). Compared to GANs, cCDM is more stable to train, and each diffusion model within the cCDM can be trained independently, thus each model's parameters can be tuned separately to maximize the performance of the pipeline. Our study compares cCDM against a cGAN model with transfer learning. Our results demonstrate that the cCDM excels in capturing finer details, preserving volume fraction constraints, and minimizing compliance errors in multi-resolution tasks when a sufficient amount of high-resolution training data (more than 102 designs) is available. Furthermore, we explore the impact of training data size on the performance of both models. While both models show decreased performance with reduced high-resolution training data, the cCDM loses its superiority to the cGAN model with transfer learning when training data is limited (less than 102), and we show the break-even point for this transition. Also, we highlight that while the diffusion model may achieve better pixel-wise performance in both low-resolution and high-resolution scenarios, this does not necessarily guarantee that the model produces optimal compliance error or constraint satisfaction.
