A Symplectic Analysis of Alternating Mirror Descent
Jonas Katona, Xiuyuan Wang, Andre Wibisono
TL;DR
The paper investigates Alternating Mirror Descent (AMD) for bilinear zero-sum games through the lens of Hamiltonian dynamics by discretizing the continuous-time flow with the symplectic Euler method. It develops a rigorous framework using Lie algebras and the Baker-Campbell-Hausdorff (BCH) expansion to define a modified Hamiltonian $\widetilde{H}_{\eta}$ that is (formally) conserved by the discretization and that interpolates the AMD trajectory. Key contributions include a closed-form expression for $\widetilde{H}_{\eta}$ in the quadratic case, improved regret bounds $\mathcal{O}(K^{1/5})$ and duality-gap decay $\mathcal{O}(K^{-4/5})$ for AMD, and a conjecture suggesting further improvements to $\mathcal{O}(K^{\varepsilon})$ and $\mathcal{O}(K^{-1+\varepsilon})$ for any $\varepsilon>0$, with absolute convergence yielding $\mathcal{O}(1)$ regret and $\mathcal{O}(K^{-1})$ duality. The work also connects AMD to symplectic integrators via a symmetric decomposition of the payoff matrix, providing a principled energy-based analysis of AMD and proposing avenues for leveraging higher-order symplectic schemes in game-theoretic learning. Overall, the paper bridges symplectic numerical analysis and algorithmic game theory, offering geometric insights and concrete algorithmic guarantees for AMD.
Abstract
Motivated by understanding the behavior of the Alternating Mirror Descent (AMD) algorithm for bilinear zero-sum games, we study the discretization of continuous-time Hamiltonian flow via the symplectic Euler method. We provide a framework for analysis using results from Hamiltonian dynamics, Lie algebra, and symplectic numerical integrators, with an emphasis on the existence and properties of a conserved quantity, the modified Hamiltonian (MH), for the symplectic Euler method. We compute the MH in closed-form when the original Hamiltonian is a quadratic function, and show that it generally differs from the other conserved quantity known previously in that case. We derive new error bounds on the MH when truncated at orders in the stepsize in terms of the number of iterations, $K$, and use these bounds to show an improved $\mathcal{O}(K^{1/5})$ total regret bound and an $\mathcal{O}(K^{-4/5})$ duality gap of the average iterates for AMD. Finally, we propose a conjecture which, if true, would imply that the total regret for AMD scales as $\mathcal{O}\left(K^{\varepsilon}\right)$ and the duality gap of the average iterates as $\mathcal{O}\left(K^{-1+\varepsilon}\right)$ for any $\varepsilon>0$, and we can take $\varepsilon=0$ upon certain convergence conditions for the MH.
