RE-SORT: Removing Spurious Correlation in Multilevel Interaction for CTR Prediction
Song-Li Wu, Liang Du, Jia-Qi Yang, Yu-Ai Wang, De-Chuan Zhan, Shuang Zhao, Zi-Xun Sun
TL;DR
RE-SORT tackles spurious correlations in CTR prediction by combining a multilevel stacked recurrent (MSR) structure with a spurious correlation elimination (SCE) module. MSR learns diverse high-order interactions across hierarchical feature spaces via two streams (D-SR and S-SR), while SCE projects features into a high-dimensional Laplacian kernel space and uses random Fourier features to decorrelate nuisance signals from the CTR task, guided by sample reweighting. The model achieves state-of-the-art accuracy and speed on four public CTR datasets and a production dataset, with ablation studies confirming the effectiveness of both MSR and SCE. The approach offers practical benefits for real-world recommender systems by improving generalization and reducing inference time, and the authors provide open-source code and data release to facilitate adoption and further research.
Abstract
Click-through rate (CTR) prediction is a critical task in recommendation systems, serving as the ultimate filtering step to sort items for a user. Most recent cutting-edge methods primarily focus on investigating complex implicit and explicit feature interactions; however, these methods neglect the spurious correlation issue caused by confounding factors, thereby diminishing the model's generalization ability. We propose a CTR prediction framework that REmoves Spurious cORrelations in mulTilevel feature interactions, termed RE-SORT, which has two key components. I. A multilevel stacked recurrent (MSR) structure enables the model to efficiently capture diverse nonlinear interactions from feature spaces at different levels. II. A spurious correlation elimination (SCE) module further leverages Laplacian kernel mapping and sample reweighting methods to eliminate the spurious correlations concealed within the multilevel features, allowing the model to focus on the true causal features. Extensive experiments conducted on four challenging CTR datasets and our production dataset demonstrate that the proposed method achieves state-of-the-art performance in both accuracy and speed. The utilized codes, models and dataset will be released at https://github.com/RE-SORT.
