Adaptive multi-gradient methods for quasiconvex vector optimization and applications to multi-task learning
Nguyen Anh Minh, Le Dung Muu, Tran Ngoc Thang
TL;DR
The paper tackles nonconvex, multiobjective optimization on unbounded feasible sets by introducing a line-search-free adaptive step-size scheme that yields a monotone multi-gradient descent via a KKT-derived direction $s(x)$ and objective decrease $\Theta(x)=-\tfrac12\|s(x)\|^2$. It provides convergence guarantees under mild assumptions and extends the framework with preference vectors to achieve a more uniform distribution of Pareto points across the front, all while maintaining scalable performance for large-scale multi-task learning problems. Empirical results on synthetic MOPs and real-world MTL tasks (e.g., Multi-MNIST, Drug Review) show faster convergence and improved front coverage compared with PMTL, validating the practical impact of the approach. The proposed line-search-free, adaptive scheme reduces per-iteration cost and offers a flexible, principled method for simultaneously optimizing multiple tasks with diverse trade-offs.
Abstract
We present an adaptive step-size method, which does not include line-search techniques, for solving a wide class of nonconvex multiobjective programming problems on an unbounded constraint set. We also prove convergence of a general approach under modest assumptions. More specifically, the convexity criterion might not be satisfied by the objective function. Unlike descent line-search algorithms, it does not require an initial step-size to be determined by a previously determined Lipschitz constant. The process's primary characteristic is its gradual step-size reduction up until a predetermined condition is met. It can be specifically applied to offer an innovative multi-gradient projection method for unbounded constrained optimization issues. Preliminary findings from a few computational examples confirm the accuracy of the strategy. We apply the proposed technique to some multi-task learning experiments to show its efficacy for large-scale challenges.
