Flow Score Distillation for Diverse Text-to-3D Generation
Runjie Yan, Kailu Wu, Kaisheng Ma
TL;DR
Flow Score Distillation (FSD) tackles the diversity limitation of Score Distillation Sampling (SDS) in text-to-3D generation by linking SDS to the DDIM PF-ODE formulation and replacing stochastic noise with a deterministic, view-coherent noise strategy via a world-map noise function $\boldsymbol{\epsilon}(\boldsymbol{c})$. The method reframes the PF-ODE as an SDS-like loss and uses a monotone timestep schedule to align with DDIM, which improves generation diversity without sacrificing quality. FSD is lifted to 3D by designing a view-dependent noise and a stable 3D rendering loss $L_{FSD}$, with a world-map noise design that avoids geometry holes seen with naive noise. Experiments on Stable Diffusion and MVDream backbones show substantial diversity gains and robust quality, demonstrating that flow-based diffusion priors can be effectively applied to text-to-3D generation.
Abstract
Recent advancements in Text-to-3D generation have yielded remarkable progress, particularly through methods that rely on Score Distillation Sampling (SDS). While SDS exhibits the capability to create impressive 3D assets, it is hindered by its inherent maximum-likelihood-seeking essence, resulting in limited diversity in generation outcomes. In this paper, we discover that the Denoise Diffusion Implicit Models (DDIM) generation process (\ie PF-ODE) can be succinctly expressed using an analogue of SDS loss. One step further, one can see SDS as a generalized DDIM generation process. Following this insight, we show that the noise sampling strategy in the noise addition stage significantly restricts the diversity of generation results. To address this limitation, we present an innovative noise sampling approach and introduce a novel text-to-3D method called Flow Score Distillation (FSD). Our validation experiments across various text-to-image Diffusion Models demonstrate that FSD substantially enhances generation diversity without compromising quality.
