Statistical and Computational Efficiency for Smooth Tensor Estimation with Unknown Permutations
Chanwoo Lee, Miaoyan Wang
TL;DR
This work develops a permuted smooth tensor framework to denoise high-order tensors with an unknown latent permutation, linking Hölder-smooth function representation to block-wise polynomial approximations on a fixed grid. It reveals a phase transition in recovery governed by the smoothness parameter and tensor order, establishing minimax rates that separate nonparametric and permutation-driven complexity, and showing a sufficiency threshold for polynomial degree. The authors present a statistically optimal yet computationally intensive least-squares estimator, and a scalable Borda-count algorithm that achieves the minimax rate under a monotonicity assumption, with theoretical guarantees and empirical validation on synthetic data and Chicago crime data. The results illuminate fundamental statistical-computational gaps in permuted tensor estimation and provide practical tools, including a public R package, for structured tensor denoising in diverse applications.
Abstract
We consider the problem of structured tensor denoising in the presence of unknown permutations. Such data problems arise commonly in recommendation system, neuroimaging, community detection, and multiway comparison applications. Here, we develop a general family of smooth tensor models up to arbitrary index permutations; the model incorporates the popular tensor block models and Lipschitz hypergraphon models as special cases. We show that a constrained least-squares estimator in the block-wise polynomial family achieves the minimax error bound. A phase transition phenomenon is revealed with respect to the smoothness threshold needed for optimal recovery. In particular, we find that a polynomial of degree up to $(m-2)(m+1)/2$ is sufficient for accurate recovery of order-$m$ tensors, whereas higher degree exhibits no further benefits. This phenomenon reveals the intrinsic distinction for smooth tensor estimation problems with and without unknown permutations. Furthermore, we provide an efficient polynomial-time Borda count algorithm that provably achieves optimal rate under monotonicity assumptions. The efficacy of our procedure is demonstrated through both simulations and Chicago crime data analysis.
