Table of Contents
Fetching ...

Online Learning in Semiparametric Econometric Models

Xiaohong Chen, Elie Tamer, Qingsong Yao

TL;DR

An online learning framework for semiparametric monotone index models with an unknown monotone link function is developed, using a two-phase learning paradigm that yields consistent estimation from arbitrary initialization.

Abstract

Data in modern economic and financial applications often arrive as a stream, requiring models and inference to be updated in real time -- yet most semiparametric methods remain batch-based and computationally impractical in large-scale streaming settings. We develop an online learning framework for semiparametric monotone index models with an unknown monotone link function. Our approach uses a two-phase learning paradigm. In a warm-start phase, we introduce a new online algorithm for the finite-dimensional parameter that is globally stable, yielding consistent estimation from arbitrary initialization. In a subsequent rate-optimal phase, we update the finite-dimensional parameter using an orthogonalized score while learning the unknown link via an online sieve method; this phase achieves optimal convergence rates for both components. The procedure processes only the most recent data batch, making it suitable when data cannot be stored (e.g., memory, privacy, or security constraints), and its resulting parameter trajectories enable online inference such as confidence regions--on parameters including policy-effect analysis with negligible additional computation. Monte Carlo experiments on both simulated and real data show adequate performance especially relative to full sample methods.

Online Learning in Semiparametric Econometric Models

TL;DR

An online learning framework for semiparametric monotone index models with an unknown monotone link function is developed, using a two-phase learning paradigm that yields consistent estimation from arbitrary initialization.

Abstract

Data in modern economic and financial applications often arrive as a stream, requiring models and inference to be updated in real time -- yet most semiparametric methods remain batch-based and computationally impractical in large-scale streaming settings. We develop an online learning framework for semiparametric monotone index models with an unknown monotone link function. Our approach uses a two-phase learning paradigm. In a warm-start phase, we introduce a new online algorithm for the finite-dimensional parameter that is globally stable, yielding consistent estimation from arbitrary initialization. In a subsequent rate-optimal phase, we update the finite-dimensional parameter using an orthogonalized score while learning the unknown link via an online sieve method; this phase achieves optimal convergence rates for both components. The procedure processes only the most recent data batch, making it suitable when data cannot be stored (e.g., memory, privacy, or security constraints), and its resulting parameter trajectories enable online inference such as confidence regions--on parameters including policy-effect analysis with negligible additional computation. Monte Carlo experiments on both simulated and real data show adequate performance especially relative to full sample methods.
Paper Structure (23 sections, 16 theorems, 322 equations, 2 figures, 7 tables, 2 algorithms)

This paper contains 23 sections, 16 theorems, 322 equations, 2 figures, 7 tables, 2 algorithms.

Key Result

Lemma 1

Let condition1 and condition2 hold, for any choice of $\{h_k\}_{k=1}^{\infty}$ with $h_k\downarrow 0$, we have that

Figures (2)

  • Figure 1: Empirical Results: Online Learning Updates and PR Averages
  • Figure 2: Empirical Results: Confidence Band based on Random Scaling

Theorems & Definitions (36)

  • Remark 1
  • Lemma 1
  • Remark 2: Contraction Intuition
  • Remark 3
  • Lemma 2
  • Theorem 1
  • Theorem 2
  • Theorem 3
  • Remark 4
  • Theorem 4
  • ...and 26 more