BLADE: A Behavior-Level Data Augmentation Framework with Dual Fusion Modeling for Multi-Behavior Sequential Recommendation
Yupeng Li, Mingyue Cheng, Yucong Luo, Yitong Zhou, Qingyang Mao, Shijin Wang
TL;DR
The paper addresses multi-behavior sequential recommendation by tackling behavior heterogeneity and data sparsity. It introduces BLADE, a framework that combines dual item-behavior fusion (early and intermediate) with behavior-level data augmentation and a sequence-level contrastive loss. Key contributions include a dual fusion architecture, three behavior-level augmentation techniques (co-occurrence addition, frequency masking, auxiliary flipping), and a behavior-richness loss weighting, all validated on three real-world datasets where BLADE consistently outperforms baselines. The results demonstrate improved modeling of complex user preferences and enhanced robustness to long-tail behaviors, with practical implications for real-world recommendation systems.
Abstract
Multi-behavior sequential recommendation aims to capture users' dynamic interests by modeling diverse types of user interactions over time. Although several studies have explored this setting, the recommendation performance remains suboptimal, mainly due to two fundamental challenges: the heterogeneity of user behaviors and data sparsity. To address these challenges, we propose BLADE, a framework that enhances multi-behavior modeling while mitigating data sparsity. Specifically, to handle behavior heterogeneity, we introduce a dual item-behavior fusion architecture that incorporates behavior information at both the input and intermediate levels, enabling preference modeling from multiple perspectives. To mitigate data sparsity, we design three behavior-level data augmentation methods that operate directly on behavior sequences rather than core item sequences. These methods generate diverse augmented views while preserving the semantic consistency of item sequences. These augmented views further enhance representation learning and generalization via contrastive learning. Experiments on three real-world datasets demonstrate the effectiveness of our approach.
