Simultaneous Learning and Optimization via Misspecified Saddle Point Problems
Mohammad Mahdi Ahmadi, Erfan Yazdandoost Hamedani
TL;DR
This work tackles misspecified saddle point problems where an unknown parameter $\theta^*$, learned from data, influences the optimization objective. It develops two APD-based algorithms—Naive APD and Learning-aware APD—where the latter explicitly accounts for parameter dynamics and employs a backtracking step-size strategy, achieving a convergence rate of $\mathcal{O}(\log K / K)$ with a smaller constant. The paper further extends the framework to settings with multiple learning solutions, obtaining a $\mathcal{O}(1/\sqrt{K})$ rate under a structured, pessimistic formulation, and validates the approach on a misspecified portfolio problem, showing superior empirical performance. Collectively, these results provide a flexible, data-driven method for jointly solving optimization and learning tasks under parameter misspecification with practical impact on portfolio optimization and robust learning applications.
Abstract
We study a class of misspecified saddle point (SP) problems, where the optimization objective depends on an unknown parameter that must be learned concurrently from data. Unlike existing studies that assume parameters are fully known or pre-estimated, our framework integrates optimization and learning into a unified formulation, enabling a more flexible problem class. To address this setting, we propose two algorithms based on the accelerated primal-dual (APD) by Hamedani & Aybat 2021. In particular, we first analyze the naive extension of the APD method by directly substituting the evolving parameter estimates into the primal-dual updates; then, we design a new learning-aware variant of the APD method that explicitly accounts for parameter dynamics by adjusting the momentum updates. Both methods achieve a provable convergence rate of $\mathcal{O}(\log K / K)$, while the learning-aware approach attains a tighter $\mathcal{O}(1)$ constant and further benefits from an adaptive step-size selection enabled by a backtracking strategy. Furthermore, we extend the framework to problems where the learning problem admits multiple optimal solutions, showing that our modified algorithm for a structured setting achieves an $\mathcal{O}(1/\sqrt{K})$ rate. To demonstrate practical impact, we evaluate our methods on a misspecified portfolio optimization problem and show superior empirical performance compared to state-of-the-art algorithms.
