M-HOF-Opt: Multi-Objective Hierarchical Output Feedback Optimization via Multiplier Induced Loss Landscape Scheduling
Xudong Sun, Nutan Chen, Alexej Gossmann, Matteo Wohlrapp, Yu Xing, Carla Feistner, Emilio Dorigatt, Felix Drost, Daniele Scarcella, Lisa Beer, Carsten Marr
TL;DR
This work addresses the challenge of optimizing multiple loss terms in domain-generalization settings where conventional hyperparameter tuning is impractical. It introduces M-HOF-Opt, a hierarchical, output-feedback framework that jointly adapts model parameters and loss-term multipliers via a probabilistic graphical model with a hypervolume-based objective and a PI-like controller. By decomposing the problem into constrained sub-goals with shrinking reference bounds, it achieves Pareto descent without modifying the inner optimizer and without heavy memory burdens. The method demonstrates robust out-of-domain generalization across multi-term losses (e.g., DIVA on PACS) and reduces sensitivity to controller hyperparameters, offering scalable, resource-efficient multi-objective optimization for deep learning.
Abstract
A probabilistic graphical model is proposed, modeling the joint model parameter and multiplier evolution, with a hypervolume based likelihood, promoting multi-objective descent in structural risk minimization. We address multi-objective model parameter optimization via a surrogate single objective penalty loss with time-varying multipliers, equivalent to online scheduling of loss landscape. The multi-objective descent goal is dispatched hierarchically into a series of constraint optimization sub-problems with shrinking bounds according to Pareto dominance. The bound serves as setpoint for the low-level multiplier controller to schedule loss landscapes via output feedback of each loss term. Our method forms closed loop of model parameter dynamic, circumvents excessive memory requirements and extra computational burden of existing multi-objective deep learning methods, and is robust against controller hyperparameter variation, demonstrated on domain generalization tasks with multi-dimensional regularization losses.
