Two-timescale Extragradient for Finding Local Minimax Points
Jiseok Chae, Kyuwon Kim, Donghwan Kim
TL;DR
This paper analyzes minimax optimization in nonconvex-nonconcave settings by proposing a two-timescale extragradient (EG) method and studying its dynamics via dynamical-systems theory. It develops a refined second-order characterization of local minimax points using a restricted Schur complement and introduces hemicurvature to capture spectral behavior when the Hessian in the maximization variable is degenerate. The authors prove that two-timescale EG converges to points satisfying a refined second-order necessary condition, almost surely avoids strict non-minimax points, and, under Minty variational inequality (MVI), globally converges to local minimax points. The results extend prior work by removing nondegeneracy requirements on ${\nabla_{yy}^2 f}$ and provide a unified, dynamical-systems framework for understanding stability and convergence in minimax optimization, with implications for training generative models and adversarial settings.
Abstract
Minimax problems are notoriously challenging to optimize. However, we present that the two-timescale extragradient method can be a viable solution. By utilizing dynamical systems theory, we show that it converges to points that satisfy the second-order necessary condition of local minimax points, under mild conditions that the two-timescale gradient descent ascent fails to work. This work provably improves upon all previous results on finding local minimax points, by eliminating a crucial assumption that the Hessian with respect to the maximization variable is nondegenerate.
