Table of Contents
Fetching ...

Robust Uplift Modeling with Large-Scale Contexts for Real-time Marketing

Zexu Sun, Qiyu Han, Minqin Zhu, Hao Gong, Dugang Liu, Chen Ma

TL;DR

A novel model-agnostic Robust Uplift Modeling with Large-Scale Contexts (UMLC) framework for Real-time Marketing and conducts extensive experiments on a synthetic dataset and a real-world product dataset to verify the effectiveness and compatibility of the UMLC.

Abstract

Improving user engagement and platform revenue is crucial for online marketing platforms. Uplift modeling is proposed to solve this problem, which applies different treatments (e.g., discounts, bonus) to satisfy corresponding users. Despite progress in this field, limitations persist. Firstly, most of them focus on scenarios where only user features exist. However, in real-world scenarios, there are rich contexts available in the online platform (e.g., short videos, news), and the uplift model needs to infer an incentive for each user on the specific item, which is called real-time marketing. Thus, only considering the user features will lead to biased prediction of the responses, which may cause the cumulative error for uplift prediction. Moreover, due to the large-scale contexts, directly concatenating the context features with the user features will cause a severe distribution shift in the treatment and control groups. Secondly, capturing the interaction relationship between the user features and context features can better predict the user response. To solve the above limitations, we propose a novel model-agnostic Robust Uplift Modeling with Large-Scale Contexts (UMLC) framework for Real-time Marketing. Our UMLC includes two customized modules. 1) A response-guided context grouping module for extracting context features information and condensing value space through clusters. 2) A feature interaction module for obtaining better uplift prediction. Specifically, this module contains two parts: a user-context interaction component for better modeling the response; a treatment-feature interaction component for discovering the treatment assignment sensitive feature of each instance to better predict the uplift. Moreover, we conduct extensive experiments on a synthetic dataset and a real-world product dataset to verify the effectiveness and compatibility of our UMLC.

Robust Uplift Modeling with Large-Scale Contexts for Real-time Marketing

TL;DR

A novel model-agnostic Robust Uplift Modeling with Large-Scale Contexts (UMLC) framework for Real-time Marketing and conducts extensive experiments on a synthetic dataset and a real-world product dataset to verify the effectiveness and compatibility of the UMLC.

Abstract

Improving user engagement and platform revenue is crucial for online marketing platforms. Uplift modeling is proposed to solve this problem, which applies different treatments (e.g., discounts, bonus) to satisfy corresponding users. Despite progress in this field, limitations persist. Firstly, most of them focus on scenarios where only user features exist. However, in real-world scenarios, there are rich contexts available in the online platform (e.g., short videos, news), and the uplift model needs to infer an incentive for each user on the specific item, which is called real-time marketing. Thus, only considering the user features will lead to biased prediction of the responses, which may cause the cumulative error for uplift prediction. Moreover, due to the large-scale contexts, directly concatenating the context features with the user features will cause a severe distribution shift in the treatment and control groups. Secondly, capturing the interaction relationship between the user features and context features can better predict the user response. To solve the above limitations, we propose a novel model-agnostic Robust Uplift Modeling with Large-Scale Contexts (UMLC) framework for Real-time Marketing. Our UMLC includes two customized modules. 1) A response-guided context grouping module for extracting context features information and condensing value space through clusters. 2) A feature interaction module for obtaining better uplift prediction. Specifically, this module contains two parts: a user-context interaction component for better modeling the response; a treatment-feature interaction component for discovering the treatment assignment sensitive feature of each instance to better predict the uplift. Moreover, we conduct extensive experiments on a synthetic dataset and a real-world product dataset to verify the effectiveness and compatibility of our UMLC.

Paper Structure

This paper contains 35 sections, 2 theorems, 22 equations, 11 figures, 5 tables, 1 algorithm.

Key Result

Proposition 1

If we can find a predictive function $f$ and transformation function $\xi$ with Lipschitz constraints on contexts such that $|h\left(\boldsymbol{x}^u, \boldsymbol{x}^c, t\right)-f\left(\boldsymbol{x}^u, \xi(\boldsymbol{x}^c), t\right)| \leq \mu, \forall \boldsymbol{x}^u,\boldsymbol{x}^c,t$ and $\lef

Figures (11)

  • Figure 1: An example of distributions of standard RCTs and the RCTs considering the contexts. When considering the uncontrollable contexts in standard RCTs, there may be the significant distribution shift between the treatment and control groups.
  • Figure 2: The overall structure of our UMLC framework. The left is the response-guided context grouping module, the right is the feature interaction module.
  • Figure 3: The visualization of the Production dataset. As $t$ increases, the clarity of the video correspondingly enhances.
  • Figure 4: The context embedding t-SNE visualization of different group number $K$ (2-10) and the sample counts on the trained context embedding of Synthetic dataset. The first is the distribution of the original data.
  • Figure 5: The evaluation of different group number $K$ (2-10). We report the mean over five runs with different seeds.
  • ...and 6 more figures

Theorems & Definitions (2)

  • Proposition 1
  • Proposition 1