Exploring New Frontiers in Vertical Federated Learning: the Role of Saddle Point Reformulation
Aleksandr Beznosikov, Georgiy Kormakov, Alexander Grigorievskiy, Mikhail Rudakov, Ruslan Nazykov, Alexander Rogozin, Anton Vakhrushev, Andrey Savchenko, Martin Takáč, Alexander Gasnikov
TL;DR
The paper reframes Vertical Federated Learning (VFL) as a convex–concave saddle point problem via the classical Lagrangian, enabling the use of ExtraGradient methods. It develops a deterministic baseline and a rich family of stochastic, communication-efficient extensions (compression, quantization, asynchronous participation, coordinate updates, privacy-preserving noise and encryption) with convergence guarantees. It also explores multiple reformulations (augmented Lagrangian, dual-loss, and variable-augmented forms) and extends the approach to non-convex models, accompanied by numerical experiments that demonstrate competitive performance against traditional minimization-based methods and improvements in privacy and communication efficiency. The results provide a versatile toolbox for scalable, privacy-conscious VFL, applicable to a wide range of real-world settings and data-partitioning schemes.
Abstract
The objective of Vertical Federated Learning (VFL) is to collectively train a model using features available on different devices while sharing the same users. This paper focuses on the saddle point reformulation of the VFL problem via the classical Lagrangian function. We first demonstrate how this formulation can be solved using deterministic methods. More importantly, we explore various stochastic modifications to adapt to practical scenarios, such as employing compression techniques for efficient information transmission, enabling partial participation for asynchronous communication, and utilizing coordinate selection for faster local computation. We show that the saddle point reformulation plays a key role and opens up possibilities to use mentioned extension that seem to be impossible in the standard minimization formulation. Convergence estimates are provided for each algorithm, demonstrating their effectiveness in addressing the VFL problem. Additionally, alternative reformulations are investigated, and numerical experiments are conducted to validate performance and effectiveness of the proposed approach.
