Matrix Editing Meets Fair Clustering: Parameterized Algorithms and Complexity
Robert Ganian, Hung P. Hoang, Simon Wietheger
TL;DR
The paper investigates the complexity of FDMC, the problem of editing a matrix to obtain at most $r$ distinct rows while ensuring every cluster is color-fair. It proves a strong hardness barrier: even the binary case 2-FDMC is W[1]-hard when parameterized by the fairlet size $ ilde{c}$ plus the budget $k$, underscoring the inadequacy of naive parameterizations. It then delineates tractable regimes via three avenues: imposing additional constraints (e.g., $ ilde{c}>k$ or combining $k$ with $r$), fixed-parameter approximation with a $(5 - 3/ ilde{c})$ factor, and an alternative parameterization using treewidth, including a treewidth-based DP for 2-FDMC. Collectively, these results map a rich complexity landscape and provide practical algorithmic strategies for fair matrix editing under constrained resources. The work also outlines potential generalizations to other fairness notions and higher-domain settings, setting a foundation for further theoretical and applied exploration in fair clustering and matrix editing.
Abstract
We study the computational problem of computing a fair means clustering of discrete vectors, which admits an equivalent formulation as editing a colored matrix into one with few distinct color-balanced rows by changing at most $k$ values. While NP-hard in both the fairness-oblivious and the fair settings, the problem is well-known to admit a fixed-parameter algorithm in the former ``vanilla'' setting. As our first contribution, we exclude an analogous algorithm even for highly restricted fair means clustering instances. We then proceed to obtain a full complexity landscape of the problem, and establish tractability results which capture three means of circumventing our obtained lower bound: placing additional constraints on the problem instances, fixed-parameter approximation, or using an alternative parameterization targeting tree-like matrices.
