PMGDA: A Preference-based Multiple Gradient Descent Algorithm
Xiaoyuan Zhang, Xi Lin, Qingfu Zhang
TL;DR
This work tackles the challenge of finding Pareto solutions in large-scale multi-objective optimization that align with a decision-maker's preferences. It introduces PMGDA, a predict-and-correct framework built on MGDA that optimizes both the objective vector and a user-specified constraint $h(\bm{\theta})$, enabling exact Pareto and ROI-focused solutions. The method leverages dual formulations for efficient prediction and a small dual LP for correction, achieving scalability to neural networks with thousands of parameters and adapting to MORL tasks. Empirical results on synthetic benchmarks, fairness multitask learning, and MORL demonstrate superior precision, ROI satisfaction, and faster convergence compared with prior approaches, with code to be released. The framework thus offers a practical, flexible pathway for preference-guided multi-objective learning in real-world, large-scale settings.
Abstract
It is desirable in many multi-objective machine learning applications, such as multi-task learning with conflicting objectives and multi-objective reinforcement learning, to find a Pareto solution that can match a given preference of a decision maker. These problems are often large-scale with available gradient information but cannot be handled very well by the existing algorithms. To tackle this critical issue, this paper proposes a novel predict-and-correct framework for locating a Pareto solution that fits the preference of a decision maker. In the proposed framework, a constraint function is introduced in the search progress to align the solution with a user-specific preference, which can be optimized simultaneously with multiple objective functions. Experimental results show that our proposed method can efficiently find a particular Pareto solution under the demand of a decision maker for standard multiobjective benchmark, multi-task learning, and multi-objective reinforcement learning problems with more than thousands of decision variables. Code is available at: https://github.com/xzhang2523/pmgda. Our code is current provided in the pgmda.rar attached file and will be open-sourced after publication.}
