Estimation of Models with Limited Data by Leveraging Shared Structure

Maryann Rui; Thibaut Horel; Munther Dahleh

Estimation of Models with Limited Data by Leveraging Shared Structure

Maryann Rui, Thibaut Horel, Munther Dahleh

TL;DR

This paper develops a three-step method which estimates the low-dimensional subspace spanned by the systems' parameters and produces refined parameter estimates within the subspace, and provides finite sample subspace estimation error guarantees.

Abstract

Modern data sets, such as those in healthcare and e-commerce, are often derived from many individuals or systems but have insufficient data from each source alone to separately estimate individual, often high-dimensional, model parameters. If there is shared structure among systems however, it may be possible to leverage data from other systems to help estimate individual parameters, which could otherwise be non-identifiable. In this paper, we assume systems share a latent low-dimensional parameter space and propose a method for recovering $d$-dimensional parameters for $N$ different linear systems, even when there are only $T<d$ observations per system. To do so, we develop a three-step algorithm which estimates the low-dimensional subspace spanned by the systems' parameters and produces refined parameter estimates within the subspace. We provide finite sample subspace estimation error guarantees for our proposed method. Finally, we experimentally validate our method on simulations with i.i.d. regression data and as well as correlated time series data.

Estimation of Models with Limited Data by Leveraging Shared Structure

TL;DR

Abstract

-dimensional parameters for

different linear systems, even when there are only

observations per system. To do so, we develop a three-step algorithm which estimates the low-dimensional subspace spanned by the systems' parameters and produces refined parameter estimates within the subspace. We provide finite sample subspace estimation error guarantees for our proposed method. Finally, we experimentally validate our method on simulations with i.i.d. regression data and as well as correlated time series data.

Paper Structure (25 sections, 6 theorems, 49 equations, 5 figures, 1 algorithm)

This paper contains 25 sections, 6 theorems, 49 equations, 5 figures, 1 algorithm.

Introduction
Related Work
Mixtures of linear regressions
Low rank matrix regression
Dictionary learning.
Meta-learning and transfer learning.
Preliminaries
General notations
Subspaces
Random variables
Model
Three-Step Estimator
Initial individual estimates
Normalization.
Comparison with dictionary learning.
...and 10 more sections

Key Result

Theorem 5.1

Let $\hat{\mathcal{B}}$ be the subspace spanned by the columns of the output $\widehat{B}$ in alg:norm with threshold level $s=\Omega(1/(\sqrt d -\sqrt{T-1}))$ if $T\leq d$, and $s =\Omega(1/(\sqrt T -\sqrt d))$ if $T>d$. Then for each $0<\delta<1$, with probability at least $1-\delta$

Figures (5)

Figure 1: Subspace estimation error vs. $N$ for estimates $\widehat{B}$ from (1) thresh and (2) norm. Results are shown for $d=50, r=5$, in regimes $d>T=10$, $T=d=50$, and $d<T=80$.
Figure 2: Comparison of $\beta_i$ estimation error of thresh as compared to oracle and naive estimators. Here, $d = 5, r = 1, T=3$.
Figure 3: Subspace estimation error vs. $N$ for estimates $\widehat{B}$ from (1) norm and (2) MoM, for i.i.d. data. Results are shown for $d=50, r=5$, in regimes $d>T=10$, $T=d=50$, and $d<T=80$..
Figure 4: Subspace estimation error vs. $T$ for estimates $\widehat{B}$ from (1) thresh, (2) norm, and (3) MoM, for i.i.d. data. Results are shown for $d=50, r=5$, for various values of $N$.
Figure 5: Subspace estimation error vs. $N$ for estimates $\widehat{B}$ from (1) norm and (2) MoM, for time-series data. Results are shown for $d=20, r=5$, in regimes $d>T=8$, $T=d=20$, and $d<T=36$.

Theorems & Definitions (12)

Theorem 5.1
Lemma A.1
proof
Lemma A.2
proof
Lemma A.3
proof
Lemma B.1
proof
Lemma B.2
...and 2 more

Estimation of Models with Limited Data by Leveraging Shared Structure

TL;DR

Abstract

Estimation of Models with Limited Data by Leveraging Shared Structure

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (12)