Fast3Dcache: Training-free 3D Geometry Synthesis Acceleration
Mengyu Yang, Yanming Yang, Chenyi Xu, Chenxi Song, Yufan Zuo, Tong Zhao, Ruibo Li, Chi Zhang
TL;DR
Fast3Dcache introduces a training-free, geometry-aware caching framework for 3D diffusion. It leverages two core components, PCSC and SSC, to predict cache budgets and select stable tokens, guided by observed voxel stabilization and latent dynamics. The approach achieves substantial inference speedups (up to 27.12%) and FLOPs reductions (54.8%) while preserving geometry with minimal distortion in Chamfer Distance and F-Score. Experiments on TRELLIS/DSO show favorable acceleration-accuracy trade-offs and compatibility with modality-agnostic accelerators, indicating practical impact for efficient 3D geometry synthesis. The work highlights the importance of 3D-specific stability cues when reusing cached computations in diffusion-based 3D generation.
Abstract
Diffusion models have achieved impressive generative quality across modalities like 2D images, videos, and 3D shapes, but their inference remains computationally expensive due to the iterative denoising process. While recent caching-based methods effectively reuse redundant computations to speed up 2D and video generation, directly applying these techniques to 3D diffusion models can severely disrupt geometric consistency. In 3D synthesis, even minor numerical errors in cached latent features accumulate, causing structural artifacts and topological inconsistencies. To overcome this limitation, we propose Fast3Dcache, a training-free geometry-aware caching framework that accelerates 3D diffusion inference while preserving geometric fidelity. Our method introduces a Predictive Caching Scheduler Constraint (PCSC) to dynamically determine cache quotas according to voxel stabilization patterns and a Spatiotemporal Stability Criterion (SSC) to select stable features for reuse based on velocity magnitude and acceleration criterion. Comprehensive experiments show that Fast3Dcache accelerates inference significantly, achieving up to a 27.12% speed-up and a 54.8% reduction in FLOPs, with minimal degradation in geometric quality as measured by Chamfer Distance (2.48%) and F-Score (1.95%).
