Fair Set Cover
Mohsen Dehghankar, Rahul Raychaudhury, Stavros Sintos, Abolfazl Asudeh
TL;DR
The paper introduces Fair Set Cover (FSC) and its weighted variant, aiming to minimize the number of sets while enforcing demographic parity across groups. It develops naive, greedy, and fast LP-based randomized algorithms that achieve zero unfairness under certain assumptions and offer competitive approximation guarantees compared to classic Set Cover, with only modest increases in running time. The authors extend the framework to generalized and relaxed fairness notions (GFSC, GFWSC, ε-FGFSC) and demonstrate strong empirical results across real and synthetic datasets, showing fairness can be attained with minimal impact on output size and computation time. The work highlights FSC’s applicability to diverse societal problems, including team formation, licensing, and fair clustering, and situates it within the broader literature on algorithmic fairness in combinatorial optimization.
Abstract
The potential harms of algorithmic decisions have ignited algorithmic fairness as a central topic in computer science. One of the fundamental problems in computer science is Set Cover, which has numerous applications with societal impacts, such as assembling a small team of individuals that collectively satisfy a range of expertise requirements. However, despite its broad application spectrum and significant potential impact, set cover has yet to be studied through the lens of fairness. Therefore, in this paper, we introduce Fair Set Cover, which aims not only to cover with a minimum-size set but also to satisfy demographic parity in its selection of sets. To this end, we develop multiple versions of fair set cover, study their hardness, and devise efficient approximation algorithms for each variant. Notably, under certain assumptions, our algorithms always guarantee zero-unfairness, with only a small increase in the approximation ratio compared to regular set cover. Furthermore, our experiments on various data sets and across different settings confirm the negligible price of fairness, as (a) the output size increases only slightly (if any) and (b) the time to compute the output does not significantly increase.
