Dual Test-time Training for Out-of-distribution Recommender System

Xihong Yang; Yiqi Wang; Jin Chen; Wenqi Fan; Xiangyu Zhao; En Zhu; Xinwang Liu; Defu Lian

Dual Test-time Training for Out-of-distribution Recommender System

Xihong Yang, Yiqi Wang, Jin Chen, Wenqi Fan, Xiangyu Zhao, En Zhu, Xinwang Liu, Defu Lian

TL;DR

This paper proposes a novel Dual Test-Time-Training framework for OOD Recommendation, termed DT3OR, which is the first work to address OOD recommendation via a test-time-training strategy and conducts experiments on five datasets with various backbones.

Abstract

Deep learning has been widely applied in recommender systems, which has achieved revolutionary progress recently. However, most existing learning-based methods assume that the user and item distributions remain unchanged between the training phase and the test phase. However, the distribution of user and item features can naturally shift in real-world scenarios, potentially resulting in a substantial decrease in recommendation performance. This phenomenon can be formulated as an Out-Of-Distribution (OOD) recommendation problem. To address this challenge, we propose a novel Dual Test-Time-Training framework for OOD Recommendation, termed DT3OR. In DT3OR, we incorporate a model adaptation mechanism during the test-time phase to carefully update the recommendation model, allowing the model to specially adapt to the shifting user and item features. To be specific, we propose a self-distillation task and a contrastive task to assist the model learning both the user's invariant interest preferences and the variant user/item characteristics during the test-time phase, thus facilitating a smooth adaptation to the shifting features. Furthermore, we provide theoretical analysis to support the rationale behind our dual test-time training framework. To the best of our knowledge, this paper is the first work to address OOD recommendation via a test-time-training strategy. We conduct experiments on three datasets with various backbones. Comprehensive experimental results have demonstrated the effectiveness of DT3OR compared to other state-of-the-art baselines.

Dual Test-time Training for Out-of-distribution Recommender System

TL;DR

Abstract

Paper Structure (34 sections, 2 theorems, 35 equations, 13 figures, 5 tables, 1 algorithm)

This paper contains 34 sections, 2 theorems, 35 equations, 13 figures, 5 tables, 1 algorithm.

Introduction
Problem Definition
Preliminary
Methodology
Self-supervised Learning Tasks Design
Self-distillation Task Design
Contrastive Task Design
Objective function
Theoretical analysis
Experiments
Datasets & Metric
Benchmark Datasets
Evaluation Metrics
Experiment Settings
Implementation Details
...and 19 more sections

Key Result

Theorem 1

$\mathcal{L}(u,i,y;\theta)$ is convex and $\beta$-smooth with respect to ${\theta}$, and the magnitude of the gradient $|\nabla{\bm \theta}\mathcal{L}(u,i,y;\theta) |$ is bounded by a positive constant $B^{'}$ for all ${\bm \theta}$.

Figures (13)

Figure 1: Illustration of OOD recommendation. The distribution shift of user and item features in test phase, leading to decline in recommendation performance.
Figure 2: Illustration of user/item feature shift.
Figure 3: The overall framework for our proposed DT3OR.
Figure 4: Illustration of the self-distillation task designing. In our proposed method, we initially obtain the user embedding $\textbf{E}_u$ and item embedding $\textbf{E}_i$ by encoding the user/item features and historical interactions with the training network. After this, we carry out K-means on the combined embedding to yield the clustering results. Subsequently, with the sharpen function, we implement self-distillation to enhance the user interest centers, thus improving the generalization of model to deal with shift features.
Figure 5: Illustration of the contrastive task designing. With the high confidence results of clustering, we select the samples in the same interest center as positive sample, while regarding the different high confidence interest centers as negative samples.
...and 8 more figures

Theorems & Definitions (5)

Definition 1: Test-time training for OOD Recommendation (T3OR).
Theorem 1
proof
Theorem 2
proof

Dual Test-time Training for Out-of-distribution Recommender System

TL;DR

Abstract

Dual Test-time Training for Out-of-distribution Recommender System

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (13)

Theorems & Definitions (5)