Fisher Information, Training and Bias in Fourier Regression Models

Lorenzo Pastori; Veronika Eyring; Mierk Schwabe

Fisher Information, Training and Bias in Fourier Regression Models

Lorenzo Pastori, Veronika Eyring, Mierk Schwabe

TL;DR

The paper addresses how Fisher information-based metrics, via the ED, predict training dynamics in Fourier-model equivalents of QNNs. It develops an analytic FIM for Fourier models and links ED to the correlation spectrum, enabling tunable ED and bias in model design. The study demonstrates a bias–ED tradeoff: high ED aids unbiased models, while low ED aids biased ones, and shows this behavior persists in tensorized Fourier models that scale to larger problem sizes. Overall, the work clarifies how geometric properties and task alignment govern trainability in quantum-inspired regression, with tensor networks offering a scalable analysis framework.

Abstract

Motivated by the growing interest in quantum machine learning, in particular quantum neural networks (QNNs), we study how recently introduced evaluation metrics based on the Fisher information matrix (FIM) are effective for predicting their training and prediction performance. We exploit the equivalence between a broad class of QNNs and Fourier models, and study the interplay between the \emph{effective dimension} and the \emph{bias} of a model towards a given task, investigating how these affect the model's training and performance. We show that for a model that is completely agnostic, or unbiased, towards the function to be learned, a higher effective dimension likely results in a better trainability and performance. On the other hand, for models that are biased towards the function to be learned a lower effective dimension is likely beneficial during training. To obtain these results, we derive an analytical expression of the FIM for Fourier models and identify the features controlling a model's effective dimension. This allows us to construct models with tunable effective dimension and bias, and to compare their training. We furthermore introduce a tensor network representation of the considered Fourier models, which could be a tool of independent interest for the analysis of QNN models. Overall, these findings provide an explicit example of the interplay between geometrical properties, model-task alignment and training, which are relevant for the broader machine learning community.

Fisher Information, Training and Bias in Fourier Regression Models

TL;DR

Abstract

Fisher Information, Training and Bias in Fourier Regression Models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (21)