Mean-field Chaos Diffusion Models
Sungwoo Park, Dongjun Kim, Ahmed Alaa
TL;DR
MF-CDMs address the curse of dimensionality in score-based generative modeling by representing high-cardinality data as a large interacting-particle system and leveraging propagation of chaos to connect finite-N training to a mean-field limit. The method replaces standard score matching with a mean-field score matching objective defined on Wasserstein space, and introduces a subdivision of chaotic entropy with particle branching to enable scalable training. Theoretical analysis provides concentration and chaos-stability results for reducible score networks, and empirically MF-CDMs deliver robust, scalable generation on synthetic data and large 3D point clouds, often surpassing traditional diffusion models at high cardinalities. This framework broadens SGMs’ applicability to unstructured, high-resolution data such as large-scale point clouds and complex stochastic systems.
Abstract
In this paper, we introduce a new class of score-based generative models (SGMs) designed to handle high-cardinality data distributions by leveraging concepts from mean-field theory. We present mean-field chaos diffusion models (MF-CDMs), which address the curse of dimensionality inherent in high-cardinality data by utilizing the propagation of chaos property of interacting particles. By treating high-cardinality data as a large stochastic system of interacting particles, we develop a novel score-matching method for infinite-dimensional chaotic particle systems and propose an approximation scheme that employs a subdivision strategy for efficient training. Our theoretical and empirical results demonstrate the scalability and effectiveness of MF-CDMs for managing large high-cardinality data structures, such as 3D point clouds.
