BMLP: Behavior-aware MLP for Heterogeneous Sequential Recommendation
Weixin Li, Yuhao Wu, Yang Liu, Weike Pan, Zhong Ming
TL;DR
This work tackles heterogeneous sequential recommendation by modeling multiple user behaviors (e.g., clicks, purchases) with a pure MLP architecture. The proposed BMLP architecture comprises two modules: HIP captures multi-granularity behavior interactions via behavior types and transitions, while PIP adaptively fuses auxiliary behaviors to infer purchase intent; a gating mechanism then merges these signals for final prediction. Empirical results on four public datasets show that BMLP achieves significant improvements over state-of-the-art baselines, with the pure-MLP design offering linear time complexity and practical efficiency. The approach provides a scalable, effective framework for real-world recommender systems that need to leverage diverse user behaviors. Key mathematical relationships include the heterogeneous embedding construction $\mathbf{x}_u^t = \text{concat}(M_u^t, V_{i_u^t})$ with $M_u^t = B_{b_u^t} + \text{trans}(b_u^t, b_u^{t+1})$, and the final prediction combining global and local intents via a gating mechanism, optimized with a cross-entropy-like loss $\mathcal{L} = -\sum_{s}\sum_{j} y_{sj} \log(\hat{r}_{t+1,j}) + (1-y_{sj}) \log(1-\hat{r}_{t+1,j})$, where all mathematical expressions are denoted in $...$.
Abstract
In real recommendation scenarios, users often have different types of behaviors, such as clicking and buying. Existing research methods show that it is possible to capture the heterogeneous interests of users through different types of behaviors. However, most multi-behavior approaches have limitations in learning the relationship between different behaviors. In this paper, we propose a novel multilayer perceptron (MLP)-based heterogeneous sequential recommendation method, namely behavior-aware multilayer perceptron (BMLP). Specifically, it has two main modules, including a heterogeneous interest perception (HIP) module, which models behaviors at multiple granularities through behavior types and transition relationships, and a purchase intent perception (PIP) module, which adaptively fuses subsequences of auxiliary behaviors to capture users' purchase intent. Compared with mainstream sequence models, MLP is competitive in terms of accuracy and has unique advantages in simplicity and efficiency. Extensive experiments show that BMLP achieves significant improvement over state-of-the-art algorithms on four public datasets. In addition, its pure MLP architecture leads to a linear time complexity.
