CSMF: Cascaded Selective Mask Fine-Tuning for Multi-Objective Embedding-Based Retrieval

Hao Deng; Haibo Xing; Kanefumi Matsuyama; Moyu Zhang; Jinxin Hu; Hong Wen; Yu Zhang; Xiaoyi Zeng; Jing Zhang

CSMF: Cascaded Selective Mask Fine-Tuning for Multi-Objective Embedding-Based Retrieval

Hao Deng, Haibo Xing, Kanefumi Matsuyama, Moyu Zhang, Jinxin Hu, Hong Wen, Yu Zhang, Xiaoyi Zeng, Jing Zhang

TL;DR

CSMF tackles multi-objective embedding-based retrieval by introducing cascaded selective mask fine-tuning that allocates independent learning space for each objective without adding parameters. It combines exposure pre-training with sequential fine-tuning on click and conversion tasks, augmented by CPP pruning and AML loss to mitigate gradient conflicts and forgetting. The framework yields a linearly fused multi-objective score without increasing final vector dimensions, enabling efficient online serving and adaptable objective weighting. Real-world experiments show consistent offline gains on industrial and AliExpress datasets and positive online A/B results in RPM, CTR, and CVR, underscoring practical impact for multi-objective EBR systems.

Abstract

Multi-objective embedding-based retrieval (EBR) has become increasingly critical due to the growing complexity of user behaviors and commercial objectives. While traditional approaches often suffer from data sparsity and limited information sharing between objectives, recent methods utilizing a shared network alongside dedicated sub-networks for each objective partially address these limitations. However, such methods significantly increase the model parameters, leading to an increased retrieval latency and a limited ability to model causal relationships between objectives. To address these challenges, we propose the Cascaded Selective Mask Fine-Tuning (CSMF), a novel method that enhances both retrieval efficiency and serving performance for multi-objective EBR. The CSMF framework selectively masks model parameters to free up independent learning space for each objective, leveraging the cascading relationships between objectives during the sequential fine-tuning. Without increasing network parameters or online retrieval overhead, CSMF computes a linearly weighted fusion score for multiple objective probabilities while supporting flexible adjustment of each objective's weight across various recommendation scenarios. Experimental results on real-world datasets demonstrate the superior performance of CSMF, and online experiments validate its significant practical value.

CSMF: Cascaded Selective Mask Fine-Tuning for Multi-Objective Embedding-Based Retrieval

TL;DR

Abstract

CSMF: Cascaded Selective Mask Fine-Tuning for Multi-Objective Embedding-Based Retrieval

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)