Tensor networks and efficient descriptions of classical data

Sirui Lu; Márton Kanász-Nagy; Ivan Kukuljan; J. Ignacio Cirac

Tensor networks and efficient descriptions of classical data

Sirui Lu, Márton Kanász-Nagy, Ivan Kukuljan, J. Ignacio Cirac

TL;DR

This work investigates the potential of tensor-network-based machine-learning methods to scale to large image and text datasets and introduces two models to reproduce this scaling: a quantum-inspired random pair toy model and a linguistically motivated Markovian dependency tree model.

Abstract

We investigate the potential of tensor network based machine learning methods to scale to large image and text data sets. For that, we study how the mutual information between a subregion and its complement scales with the subsystem size $L$, similarly to how it is done in quantum many-body physics. We find that for text, the mutual information scales as a power law $L^ν$ with a close to volume law exponent, indicating that text cannot be efficiently described by 1D tensor networks. For images, the scaling is close to an area law, hinting at 2D tensor networks such as PEPS could have an adequate expressibility. For the numerical analysis, we introduce a mutual information estimator based on autoregressive networks, and we also use convolutional neural networks in a neural estimator method.

Tensor networks and efficient descriptions of classical data

TL;DR

Abstract

, similarly to how it is done in quantum many-body physics. We find that for text, the mutual information scales as a power law

with a close to volume law exponent, indicating that text cannot be efficiently described by 1D tensor networks. For images, the scaling is close to an area law, hinting at 2D tensor networks such as PEPS could have an adequate expressibility. For the numerical analysis, we introduce a mutual information estimator based on autoregressive networks, and we also use convolutional neural networks in a neural estimator method.

Tensor networks and efficient descriptions of classical data

TL;DR

Abstract

Tensor networks and efficient descriptions of classical data

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (11)