Invariant debiasing learning for recommendation via biased imputation
Ting Bai, Weijie Chen, Cheng Yang, Chuan Shi
TL;DR
The paper demonstrates that invariant-only debiasing can discard informative variant cues and harm generalization in recommender systems. It introduces KD-Debias, a two-part framework combining disentangled invariant/variant preferences with distance-aware knowledge distillation to transfer biased information into a lightweight unbiased predictor. A distance-based fusion and a teacher–student distillation setup produce unbiased recommendations without requiring unbiased data, achieving state-of-the-art results on Yahoo!R3, Coat, and Mind with substantially fewer parameters. The work highlights the value of leveraging biased variant information to enhance debiasing and offers a practical unsupervised solution for robust recommendations.
Abstract
Previous debiasing studies utilize unbiased data to make supervision of model training. They suffer from the high trial risks and experimental costs to obtain unbiased data. Recent research attempts to use invariant learning to detach the invariant preference of users for unbiased recommendations in an unsupervised way. However, it faces the drawbacks of low model accuracy and unstable prediction performance due to the losing cooperation with variant preference. In this paper, we experimentally demonstrate that invariant learning causes information loss by directly discarding the variant information, which reduces the generalization ability and results in the degradation of model performance in unbiased recommendations. Based on this consideration, we propose a novel lightweight knowledge distillation framework (KDDebias) to automatically learn the unbiased preference of users from both invariant and variant information. Specifically, the variant information is imputed to the invariant user preference in the distance-aware knowledge distillation process. Extensive experiments on three public datasets, i.e., Yahoo!R3, Coat, and MIND, show that with the biased imputation from the variant preference of users, our proposed method achieves significant improvements with less than 50% learning parameters compared to the SOTA unsupervised debiasing model in recommender systems. Our code is publicly available at https://github.com/BAI-LAB/KD-Debias.
