Table of Contents
Fetching ...

Doubly Calibrated Estimator for Recommendation on Data Missing Not At Random

Wonbin Kweon, Hwanjo Yu

TL;DR

A Doubly Calibrated Estimator that involves the calibration of both the imputation and propensity models and a tri-level joint learning framework allowing the simultaneous optimization of calibration experts alongside prediction and imputation models is proposed.

Abstract

Recommender systems often suffer from selection bias as users tend to rate their preferred items. The datasets collected under such conditions exhibit entries missing not at random and thus are not randomized-controlled trials representing the target population. To address this challenge, a doubly robust estimator and its enhanced variants have been proposed as they ensure unbiasedness when accurate imputed errors or predicted propensities are provided. However, we argue that existing estimators rely on miscalibrated imputed errors and propensity scores as they depend on rudimentary models for estimation. We provide theoretical insights into how miscalibrated imputation and propensity models may limit the effectiveness of doubly robust estimators and validate our theorems using real-world datasets. On this basis, we propose a Doubly Calibrated Estimator that involves the calibration of both the imputation and propensity models. To achieve this, we introduce calibration experts that consider different logit distributions across users. Moreover, we devise a tri-level joint learning framework, allowing the simultaneous optimization of calibration experts alongside prediction and imputation models. Through extensive experiments on real-world datasets, we demonstrate the superiority of the Doubly Calibrated Estimator in the context of debiased recommendation tasks.

Doubly Calibrated Estimator for Recommendation on Data Missing Not At Random

TL;DR

A Doubly Calibrated Estimator that involves the calibration of both the imputation and propensity models and a tri-level joint learning framework allowing the simultaneous optimization of calibration experts alongside prediction and imputation models is proposed.

Abstract

Recommender systems often suffer from selection bias as users tend to rate their preferred items. The datasets collected under such conditions exhibit entries missing not at random and thus are not randomized-controlled trials representing the target population. To address this challenge, a doubly robust estimator and its enhanced variants have been proposed as they ensure unbiasedness when accurate imputed errors or predicted propensities are provided. However, we argue that existing estimators rely on miscalibrated imputed errors and propensity scores as they depend on rudimentary models for estimation. We provide theoretical insights into how miscalibrated imputation and propensity models may limit the effectiveness of doubly robust estimators and validate our theorems using real-world datasets. On this basis, we propose a Doubly Calibrated Estimator that involves the calibration of both the imputation and propensity models. To achieve this, we introduce calibration experts that consider different logit distributions across users. Moreover, we devise a tri-level joint learning framework, allowing the simultaneous optimization of calibration experts alongside prediction and imputation models. Through extensive experiments on real-world datasets, we demonstrate the superiority of the Doubly Calibrated Estimator in the context of debiased recommendation tasks.
Paper Structure (28 sections, 7 theorems, 23 equations, 3 figures, 3 tables, 1 algorithm)

This paper contains 28 sections, 7 theorems, 23 equations, 3 figures, 3 tables, 1 algorithm.

Key Result

Lemma 1

The bias and variance of DR estimator are computed as:

Figures (3)

  • Figure 1: Performance of DR estimators with various imputation/propensity models on Yahoo!R3 dataset.
  • Figure 2: Ablation study on Yahoo!R3.
  • Figure 3: Hyper-parameter study on number of experts ($K$).

Theorems & Definitions (7)

  • Lemma 1: Bias and Variance of DR estimator
  • Theorem 2
  • Corollary 3
  • Theorem 4
  • Theorem 2
  • Corollary 3
  • Theorem 4