Characterising Bias in Compressed Models
Sara Hooker, Nyalleng Moorosi, Gregory Clark, Samy Bengio, Emily Denton
TL;DR
This work reveals that neural network compression via pruning and quantization preserves aggregate accuracy while disproportionately harming a small subset of instances, exposing or amplifying algorithmic bias. It introduces Compression Identified Exemplars (CIE) as an attribute-agnostic tool to surface challenging cases for human auditing, even when protected attribute labels are scarce. By defining Modal CIE and Taxicab CIE, the authors provide unsupervised means to rank and surface CIEs, showing these exemplars are substantially harder for both compressed and non-compressed systems. The findings argue for integrating CIE-based auditing into deployment pipelines to mitigate fairness risks in resource-constrained settings, with practical implications for domains like hiring, healthcare, and security.
Abstract
The popularity and widespread use of pruning and quantization is driven by the severe resource constraints of deploying deep neural networks to environments with strict latency, memory and energy requirements. These techniques achieve high levels of compression with negligible impact on top-line metrics (top-1 and top-5 accuracy). However, overall accuracy hides disproportionately high errors on a small subset of examples; we call this subset Compression Identified Exemplars (CIE). We further establish that for CIE examples, compression amplifies existing algorithmic bias. Pruning disproportionately impacts performance on underrepresented features, which often coincides with considerations of fairness. Given that CIE is a relatively small subset but a great contributor of error in the model, we propose its use as a human-in-the-loop auditing tool to surface a tractable subset of the dataset for further inspection or annotation by a domain expert. We provide qualitative and quantitative support that CIE surfaces the most challenging examples in the data distribution for human-in-the-loop auditing.
