FairMT: Fairness for Heterogeneous Multi-Task Learning
Guanyu Hu, Tangzheng Lian, Na Yan, Dimitrios Kollias, Xinyu Yang, Oya Celiktutan, Siyang Song, Zeyu Fu
TL;DR
FairMT tackles fairness in multi-task learning with heterogeneous outputs and incomplete supervision by introducing an Asymmetric Heterogeneous Fairness Disparity Aggregation (AHFDA) mechanism and a head-aware optimization proxy. The framework unifies diverse fairness notions across detection, classification, and regression through a directional, worst-case-focused aggregation and a Pareto-guided primal–dual formulation. Empirical results across vision and language benchmarks show substantial fairness gains without sacrificing utility, often surpassing state-of-the-art baselines. The approach also provides scalable optimization via projection-free updates and a head-aware QP proxy, making it practical for real-world, partially labeled MTL problems.
Abstract
Fairness in machine learning has been extensively studied in single-task settings, while fair multi-task learning (MTL), especially with heterogeneous tasks (classification, detection, regression) and partially missing labels, remains largely unexplored. Existing fairness methods are predominantly classification-oriented and fail to extend to continuous outputs, making a unified fairness objective difficult to formulate. Further, existing MTL optimization is structurally misaligned with fairness: constraining only the shared representation, allowing task heads to absorb bias and leading to uncontrolled task-specific disparities. Finally, most work treats fairness as a zero-sum trade-off with utility, enforcing symmetric constraints that achieve parity by degrading well-served groups. We introduce FairMT, a unified fairness-aware MTL framework that accommodates all three task types under incomplete supervision. At its core is an Asymmetric Heterogeneous Fairness Constraint Aggregation mechanism, which consolidates task-dependent asymmetric violations into a unified fairness constraint. Utility and fairness are jointly optimized via a primal--dual formulation, while a head-aware multi-objective optimization proxy provides a tractable descent geometry that explicitly accounts for head-induced anisotropy. Across three homogeneous and heterogeneous MTL benchmarks encompassing diverse modalities and supervision regimes, FairMT consistently achieves substantial fairness gains while maintaining superior task utility. Code will be released upon paper acceptance.
