Product Jacobi-Theta Boltzmann machines with score matching
Andrea Pasquale, Daniel Krefl, Stefano Carrazza, Frank Nielsen
TL;DR
The paper tackles high-dimensional density estimation by contrasting the Riemann-Theta Boltzmann machine (RTBM) with a restricted variant, the product Jacobi-Theta Boltzmann machine (pJTBM). It adopts score matching via the Fisher divergence to train these models without needing the partition function, and leverages a diagonal hidden-block Q to enable Jacobi-Theta factorization, yielding linear-scaling derivatives for the Fisher cost. Empirical results demonstrate that the pJTBM trains orders of magnitude faster than the full RTBM and can accommodate larger hidden layers while delivering comparable goodness-of-fit, albeit with some variance differences. The work highlights a scalable approach to exact-density representation using theta-function factorization and suggests promising directions for higher-dimensional density modelling with alternative optimization schemes.
Abstract
The estimation of probability density functions is a non trivial task that over the last years has been tackled with machine learning techniques. Successful applications can be obtained using models inspired by the Boltzmann machine (BM) architecture. In this manuscript, the product Jacobi-Theta Boltzmann machine (pJTBM) is introduced as a restricted version of the Riemann-Theta Boltzmann machine (RTBM) with diagonal hidden sector connection matrix. We show that score matching, based on the Fisher divergence, can be used to fit probability densities with the pJTBM more efficiently than with the original RTBM.
