Higher Order Approximation Rates for ReLU CNNs in Korobov Spaces
Yuwen Li, Guozhi Zhang
TL;DR
The paper addresses high-order approximation of Korobov functions in $K_p^{m+1}(\Omega)$ using deep ReLU CNNs, demonstrating that higher-order smoothness yields improved $N^{-m-1}$ convergence (up to a logarithmic factor) with depth $L$ growing only mildly with dimension. It combines sparse-grid interpolation with CNN realizations by representing high-order sparse-grid basis functions through CNN-implementable products, using an approximate multiplication network $\widetilde{\times}_{M,U}$ and a square-approximation via $R_U$. The main result provides an explicit depth bound $L \le C_s d^4 m^3 N \log_2 N$ and an $L_p$-error bound $\|f-f_L\|_{L_p(\Omega)} \le C_{m,d} \|D^{\bm{m}+\bm{1}} f\|_{L_p(\Omega)} N^{-m-1} (\log_2 N)^{(m+2)(d-1)}$, indicating reduced sensitivity to dimensionality compared to Sobolev-based rates. The work suggests that higher-order expressivity of CNNs can be achieved without incurring severe curse-of-dimensionality penalties, motivating further exploration of bit-extraction techniques and related improvements in CNN approximations.
Abstract
This paper investigates the $L_p$ approximation error for higher order Korobov functions using deep convolutional neural networks (CNNs) with ReLU activation. For target functions having a mixed derivative of order m+1 in each direction, we improve classical approximation rate of second order to (m+1)-th order (modulo a logarithmic factor) in terms of the depth of CNNs. The key ingredient in our analysis is approximate representation of high-order sparse grid basis functions by CNNs. The results suggest that higher order expressivity of CNNs does not severely suffer from the curse of dimensionality.
