Edge-Colored Clustering in Hypergraphs: Beyond Minimizing Unsatisfied Edges
Alex Crane, Thomas Stanley, Blair D. Sullivan, Nate Veldt
TL;DR
This work advances edge-colored clustering by (i) delivering the first approximation for MaxECC on hypergraphs with factor $(1/(r+1))(2/e)^r$ and refining graph MaxECC to $154/405\approx0.38$ via LP-rounding and color-priority techniques, (ii) introducing generalized $\ell_p$-norm MinECC and fair/balanced variants—Color-Fair MinECC and Protected-Color MinECC—with hardness, approximation, and FPT results, and (iii) validating methods experimentally to show practical gains in fairness-aware ECC while preserving standard clustering quality. The approach blends LP relaxations, randomized rounding, and a careful analysis of independent events and color orderings, connecting ECC to conflict-graph and vertex-cover perspectives. Collectively, the results expand ECC tools for balanced clustering, fairness constraints, and protected-interaction considerations, with implications for team formation, temporal clustering, and other multiway interaction tasks. Overall, the paper highlights both improved theoretical guarantees and practical algorithms that balance accuracy with fairness in edge-colored hypergraph clustering.
Abstract
We consider a framework for clustering edge-colored hypergraphs, where the goal is to cluster (equivalently, to color) objects based on the primary type of multiway interactions they participate in. One well-studied objective is to color nodes to minimize the number of unsatisfied hyperedges -- those containing one or more nodes whose color does not match the hyperedge color. We motivate and present advances for several directions that extend beyond this minimization problem. We first provide new algorithms for maximizing satisfied edges, which is the same at optimality but is much more challenging to approximate, with all prior work restricted to graphs. We develop the first approximation algorithm for hypergraphs, and then refine it to improve the best-known approximation factor for graphs. We then introduce new objective functions that incorporate notions of balance and fairness, and provide new hardness results, approximations, and fixed-parameter tractability results.
