Vertical Federated Learning Hybrid Local Pre-training
Wenguo Li, Xinling Guo, Xu Jiao, Tiancheng Huang, Xiaoran Yan, Yao Yang
TL;DR
VFLHLP addresses the few-overlap bottleneck in vertical federated learning by performing local pre-training on all participating parties—supervised training for the active party and self-supervised learning for passive parties—and then applying these pre-trained networks to downstream VFL. The approach integrates knowledge transfer to constrain the active partys sub-model and Scarf-based SSL initializations for passive parties during federated fine-tuning, improving performance on real-world tabular advertising datasets. Empirical results on Avazu and Criteo show large gains over vanilla VFL and several baselines, with ablation studies confirming the contribution of each component. The method offers an efficient, scalable solution for leveraging unaligned local data in multi-party VFL deployments with heterogeneous data ownership.
Abstract
Vertical Federated Learning (VFL), which has a broad range of real-world applications, has received much attention in both academia and industry. Enterprises aspire to exploit more valuable features of the same users from diverse departments to boost their model prediction skills. VFL addresses this demand and concurrently secures individual parties from exposing their raw data. However, conventional VFL encounters a bottleneck as it only leverages aligned samples, whose size shrinks with more parties involved, resulting in data scarcity and the waste of unaligned data. To address this problem, we propose a novel VFL Hybrid Local Pre-training (VFLHLP) approach. VFLHLP first pre-trains local networks on the local data of participating parties. Then it utilizes these pre-trained networks to adjust the sub-model for the labeled party or enhance representation learning for other parties during downstream federated learning on aligned data, boosting the performance of federated models. The experimental results on real-world advertising datasets, demonstrate that our approach achieves the best performance over baseline methods by large margins. The ablation study further illustrates the contribution of each technique in VFLHLP to its overall performance.
