Aligned Multi Objective Optimization
Yonathan Efroni, Ben Kretzu, Daniel Jiang, Jalaj Bhandari, Zheqing, Zhu, Karen Ullrich
TL;DR
This work introduces the aligned multi-objective optimization (AMOO) framework for settings where multiple convex objectives share a common minimizer. It develops gradient-descent methods that exploit alignment by adaptively weighting objectives to maximize curvature, yielding faster convergence than naive aggregation. The CAMOO and PAMOO algorithms provide instance-dependent convergence guarantees that scale with curvature measures mu_G and mu_L, and remain robust under approximate alignment via epsilon-AAMOO. Practical implementations leverage diagonal Hessian estimates and Polyak-type step-size ideas to maintain scalability to large models. Overall, AMOO offers a principled route to harness related tasks and reward signals to accelerate learning in practice, with theoretical guarantees and supportive toy experiments.
Abstract
To date, the multi-objective optimization literature has mainly focused on conflicting objectives, studying the Pareto front, or requiring users to balance tradeoffs. Yet, in machine learning practice, there are many scenarios where such conflict does not take place. Recent findings from multi-task learning, reinforcement learning, and LLMs training show that diverse related tasks can enhance performance across objectives simultaneously. Despite this evidence, such phenomenon has not been examined from an optimization perspective. This leads to a lack of generic gradient-based methods that can scale to scenarios with a large number of related objectives. To address this gap, we introduce the Aligned Multi-Objective Optimization framework, propose new algorithms for this setting, and provide theoretical guarantees of their superior performance compared to naive approaches.
