Sparse additive function decompositions facing basis transforms
Fatima Antarou Ba, Oleh Melnyk, Christian Wald, Gabriele Steidl
TL;DR
This work addresses the challenge of sparse additive decompositions in high-dimensional functions by learning an orthogonal basis that reveals sparsity in the ANOVA/anchored decompositions. It introduces a three-step transform strategy that couples vertex reduction via gradient SVD, edge reduction via joint block diagonalization of Hessians, and blockwise sparsification through a relaxed $0$-norm loss optimized over $SO(d)$ with gradient- or Landing-based methods. The approach is supported by theoretical links between derivatives, function graphs, and decompositions, together with convergence analysis for the manifold optimization and extensive numerical experiments demonstrating recovery of two-variable summands after transformation. The results have potential impact on efficient high-dimensional integration and sparse graphical modeling by enabling sparse representations where none were apparent in the original basis.
Abstract
High-dimensional real-world systems can often be well characterized by a small number of simultaneous low-complexity interactions. The analysis of variance (ANOVA) decomposition and the anchored decomposition are typical techniques to find sparse additive decompositions of functions. In this paper, we are interested in a setting, where these decompositions are not directly spare, but become so after an appropriate basis transform. Noting that the sparsity of those additive function decompositions is equivalent to the fact that most of its mixed partial derivatives vanish, we can exploit a connection to the underlying function graphs to determine an orthogonal transform that realizes the appropriate basis change. This is done in three steps: we apply singular value decomposition to minimize the number of vertices of the function graph, and joint block diagonalization techniques of families of matrices followed by sparse minimization based on relaxations of the zero ''norm'' for minimizing the number of edges. For the latter one, we propose and analyze minimization techniques over the manifold of special orthogonal matrices. Various numerical examples illustrate the reliability of our approach for functions having, after a basis transform, a sparse additive decomposition into summands with at most two variables.
