Multi-Aggregator Time-Warping Heterogeneous Graph Neural Network for Personalized Micro-Video Recommendation
Jinkun Han, Wei Li, Zhipeng Cai, Yingshu Li
TL;DR
This work tackles micro-video personalization by introducing MTHGNN, a time-warping, multi-aggregator heterogeneous graph neural network that jointly models social relations, multi-modal content, and sequential, timeliness-driven user preferences. Key innovations include a directed multi-modal heterogeneous graph, relational and attention-based message passing, session-based attention with LSTM, and a graph-free sampling strategy for efficient inference. Empirical results on TikTok and MovieLens demonstrate consistent gains over strong baselines in precision, ranking, and timeliness, with ablations confirming the value of each component. The approach offers a practical pathway to more timely and context-aware micro-video recommendations in real-world systems.
Abstract
Micro-video recommendation is attracting global attention and becoming a popular daily service for people of all ages. Recently, Graph Neural Networks-based micro-video recommendation has displayed performance improvement for many kinds of recommendation tasks. However, the existing works fail to fully consider the characteristics of micro-videos, such as the high timeliness of news nature micro-video recommendation and sequential interactions of frequently changed interests. In this paper, a novel Multi-aggregator Time-warping Heterogeneous Graph Neural Network (MTHGNN) is proposed for personalized news nature micro-video recommendation based on sequential sessions, where characteristics of micro-videos are comprehensively studied, users' preference is mined via multi-aggregator, the temporal and dynamic changes of users' preference are captured, and timeliness is considered. Through the comparison with the state-of-the-arts, the experimental results validate the superiority of our MTHGNN model.
