Generative Modelling with Tensor Train approximations of Hamilton--Jacobi--Bellman equations
David Sommer, Robert Gruhlke, Max Kirstein, Martin Eigel, Claudia Schillings
TL;DR
This work addresses the problem of sampling from a target density with unknown normalization by leveraging reverse-time diffusion and a Hamilton-Jacobi-Bellman (HJB) formulation. The authors propose a direct, time-integrated solver that operates on compressed polynomial bases encoded as Functional Tensor Trains (FTT) and Tensor Trains (TT), enabling sample-free, normalization-agnostic estimation of the log-density via a shifted HJB equation with $v_0=\Phi$. The method provides explicit TT representations for the linear and nonlinear HJB operators, along with projection and retraction steps to bound polynomial degree and TT rank, and it uses time-adaptive Euler steps (with stiffness and error criteria) or dynamical low-rank integration to solve the tensor-valued ODE. Numerical experiments in 20 dimensions demonstrate stable rank behavior, controlled errors, and accurate reverse-time sampling for Gaussian and mixed nonlinear densities, highlighting the potential for scalable Bayesian inference without requiring normalization constants. The results suggest that the TT-based HJB solver can serve as a principled, interpretable alternative to black-box neural approaches for high-dimensional generative modeling and Bayesian sampling, with avenues for future improvements via advanced DLRA techniques and diffusion-accelerated samplers.
Abstract
Sampling from probability densities is a common challenge in fields such as Uncertainty Quantification (UQ) and Generative Modelling (GM). In GM in particular, the use of reverse-time diffusion processes depending on the log-densities of Ornstein-Uhlenbeck forward processes are a popular sampling tool. In Berner et al. [2022] the authors point out that these log-densities can be obtained by solution of a \textit{Hamilton-Jacobi-Bellman} (HJB) equation known from stochastic optimal control. While this HJB equation is usually treated with indirect methods such as policy iteration and unsupervised training of black-box architectures like Neural Networks, we propose instead to solve the HJB equation by direct time integration, using compressed polynomials represented in the Tensor Train (TT) format for spatial discretization. Crucially, this method is sample-free, agnostic to normalization constants and can avoid the curse of dimensionality due to the TT compression. We provide a complete derivation of the HJB equation's action on Tensor Train polynomials and demonstrate the performance of the proposed time-step-, rank- and degree-adaptive integration method on a nonlinear sampling task in 20 dimensions.
