Linearly-scalable learning of smooth low-dimensional patterns with permutation-aided entropic dimension reduction

Illia Horenko; Lukas Pospisil

Linearly-scalable learning of smooth low-dimensional patterns with permutation-aided entropic dimension reduction

Illia Horenko, Lukas Pospisil

TL;DR

It is shown that when selecting the Euclidean smoothness as a pattern quality criterium, both of these problems can be efficiently solved numerically as one unsupervised entropy-regularized iterative optimization problem.

Abstract

In many data science applications, the objective is to extract appropriately-ordered smooth low-dimensional data patterns from high-dimensional data sets. This is challenging since common sorting algorithms are primarily aiming at finding monotonic orderings in low-dimensional data, whereas typical dimension reduction and feature extraction algorithms are not primarily designed for extracting smooth low-dimensional data patterns. We show that when selecting the Euclidean smoothness as a pattern quality criterium, both of these problems (finding the optimal 'crisp' data permutation and extracting the sparse set of permuted low-dimensional smooth patterns) can be efficiently solved numerically as one unsupervised entropy-regularized iterative optimization problem. We formulate and prove the conditions for monotonicity and convergence of this linearly-scalable (in dimension) numerical procedure, with the iteration cost scaling of $\mathcal{O}(DT^2)$, where $T$ is the size of the data statistics and $D$ is a feature space dimension. The efficacy of the proposed method is demonstrated through the examination of synthetic examples as well as a real-world application involving the identification of smooth bankruptcy risk minimizing transition patterns from high-dimensional economical data. The results showcase that the statistical properties of the overall time complexity of the method exhibit linear scaling in the dimensionality $D$ within the specified confidence intervals.

Linearly-scalable learning of smooth low-dimensional patterns with permutation-aided entropic dimension reduction

TL;DR

Abstract

, where

is the size of the data statistics and

is a feature space dimension. The efficacy of the proposed method is demonstrated through the examination of synthetic examples as well as a real-world application involving the identification of smooth bankruptcy risk minimizing transition patterns from high-dimensional economical data. The results showcase that the statistical properties of the overall time complexity of the method exhibit linear scaling in the dimensionality

within the specified confidence intervals.

Paper Structure (6 sections, 1 theorem, 5 equations, 1 figure, 1 algorithm)

This paper contains 6 sections, 1 theorem, 5 equations, 1 figure, 1 algorithm.

Introduction
Problem formulation
Numerical approach
Results
Synthetic dataset
Taiwanese companies bankruptcy data analysis

Key Result

Theorem 1

Figures (1)

Figure 1: Recovering smooth data ordering for the synthetic data examples (A-F) and for the multidimensional data set on bankruptcy of Taiwanese companies (G-I) taiwan16.

Theorems & Definitions (2)

Theorem 1
proof

Linearly-scalable learning of smooth low-dimensional patterns with permutation-aided entropic dimension reduction

TL;DR

Abstract

Linearly-scalable learning of smooth low-dimensional patterns with permutation-aided entropic dimension reduction

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (2)