End-to-End Cost-Effective Incentive Recommendation under Budget Constraint with Uplift Modeling
Zexu Sun, Hao Yang, Dugang Liu, Yunpeng Weng, Xing Tang, Xiuqiang He
TL;DR
This work tackles end-to-end budget-constrained incentive allocation by reframing uplift modeling as a cost-aware optimization problem. It introduces E3IR, a two-module architecture with a monotonic and smooth uplift prediction component and a differentiable ILP-based allocation layer that backpropagates through the optimization process. By aligning uplift prediction with the budget-constrained decision task in a joint training objective, E3IR reduces the suboptimality gap inherent in two-stage pipelines and improves ROI-focused incentive recommendations for both binary and multi-treatment settings. Extensive experiments on public and production datasets demonstrate superior performance across uplift and budget-allocation metrics, validating the practical impact for cost-effective online marketing campaigns.
Abstract
In modern online platforms, incentives are essential factors that enhance user engagement and increase platform revenue. Over recent years, uplift modeling has been introduced as a strategic approach to assign incentives to individual customers. Especially in many real-world applications, online platforms can only incentivize customers with specific budget constraints. This problem can be reformulated as the multi-choice knapsack problem. This optimization aims to select the optimal incentive for each customer to maximize the return on investment. Recent works in this field frequently tackle the budget allocation problem using a two-stage approach. However, this solution is confronted with the following challenges: (1) The causal inference methods often ignore the domain knowledge in online marketing, where the expected response curve of a customer should be monotonic and smooth as the incentive increases. (2) An optimality gap between the two stages results in inferior sub-optimal allocation performance due to the loss of the incentive recommendation information for the uplift prediction under the limited budget constraint. To address these challenges, we propose a novel End-to-End Cost-Effective Incentive Recommendation (E3IR) model under budget constraints. Specifically, our methods consist of two modules, i.e., the uplift prediction module and the differentiable allocation module. In the uplift prediction module, we construct prediction heads to capture the incremental improvement between adjacent treatments with the marketing domain constraints (i.e., monotonic and smooth). We incorporate integer linear programming (ILP) as a differentiable layer input in the allocation module. Furthermore, we conduct extensive experiments on public and real product datasets, demonstrating that our E3IR improves allocation performance compared to existing two-stage approaches.
