Transferable and Forecastable User Targeting Foundation Model
Bin Dou, Baokun Wang, Yun Zhu, Xiaotong Lin, Yike Xu, Xiaorui Huang, Yang Chen, Yun Liu, Shaoshuai Han, Yongchao Liu, Tianyi Zhang, Yu Cheng, Weiqiang Wang, Chuntao Hong
TL;DR
FOUND tackles the dual challenges of cross-domain transferability and forecastability in user targeting by integrating heterogeneous multi-scenario data and forecastable user-text pairs through a two-stage self-supervised framework and contrastive alignment with a language model. The method combines time-aware behavioral encoding, tabular and textual encoders, and cross-attention fusion to produce rich, transferable user representations aligned with future-behavior descriptions. Zero-shot transfer and few-shot prompt-tuning enable practical deployment across domains, with extensive real-world results on Alipay benchmarks showing clear gains over strong baselines. The work demonstrates strong cross-domain generalization and practical impact, including deployment in production and notable improvements in metrics such as CTR in various applications.
Abstract
User targeting, the process of selecting targeted users from a pool of candidates for non-expert marketers, has garnered substantial attention with the advancements in digital marketing. However, existing user targeting methods encounter two significant challenges: (i) Poor cross-domain and cross-scenario transferability and generalization, and (ii) Insufficient forecastability in real-world applications. These limitations hinder their applicability across diverse industrial scenarios. In this work, we propose FOUND, an industrial-grade, transferable, and forecastable user targeting foundation model. To enhance cross-domain transferability, our framework integrates heterogeneous multi-scenario user data, aligning them with one-sentence targeting demand inputs through contrastive pre-training. For improved forecastability, the text description of each user is derived based on anticipated future behaviors, while user representations are constructed from historical information. Experimental results demonstrate that our approach significantly outperforms existing baselines in cross-domain, real-world user targeting scenarios, showcasing the superior capabilities of FOUND. Moreover, our method has been successfully deployed on the Alipay platform and is widely utilized across various scenarios.
