Approximation Theory of Tree Tensor Networks: Tensorized Univariate Functions -- Part I
Mazen Ali, Anthony Nouy
TL;DR
This work introduces a tensorization-based framework to study one-dimensional function approximation on $[0,1)$ using tree tensor networks (TNs) and their rank-structured representations. It identifies $L^p$ spaces with tensor-product spaces via the map $T_{b,d}$, defines tensor subspaces $ extbf{V}_{b,d,S}$ and the TT format, and constructs three approximation tools with measures of complexity—$ ext{compl}_{ ext{N}}$, $ ext{compl}_{ ext{C}}$, and $ ext{compl}_{ ext{S}}$—leading to the approximation classes $N_q^ ext{α}$, $C_q^ ext{α}$, and $S_q^ ext{α}$. The main result is that these classes are quasi-normed spaces with continuous embeddings $C_q^ ext{α} o S_q^ ext{α} o N_q^ ext{α} o C_q^{ ext{α}/2}$, and Part II will connect these TN spaces to Besov smoothness, including the (non-)embedding results. The work also frames tensorization as a feature step implementable in neural networks, drawing parallels to ReLU networks and highlighting the expressive power of depth and sparsity in TN architectures. Altogether, the paper typifies a bridge between TN-based approximation theory and neural-network expressivity, with Part II clarifying Besov embeddings and inverse-embedding phenomena.
Abstract
We study the approximation of functions by tensor networks (TNs). We show that Lebesgue $L^p$-spaces in one dimension can be identified with tensor product spaces of arbitrary order through tensorization. We use this tensor product structure to define subsets of $L^p$ of rank-structured functions of finite representation complexity. These subsets are then used to define different approximation classes of tensor networks, associated with different measures of complexity. These approximation classes are shown to be quasi-normed linear spaces. We study some elementary properties and relationships of said spaces. In part II of this work, we will show that classical smoothness (Besov) spaces are continuously embedded into these approximation classes. We will also show that functions in these approximation classes do not possess any Besov smoothness, unless one restricts the depth of the tensor networks. The results of this work are both an analysis of the approximation spaces of TNs and a study of the expressivity of a particular type of neural networks (NN) -- namely feed-forward sum-product networks with sparse architecture. The input variables of this network result from the tensorization step, interpreted as a particular featuring step which can also be implemented with a neural network with a specific architecture. We point out interesting parallels to recent results on the expressivity of rectified linear unit (ReLU) networks -- currently one of the most popular type of NNs.
