Label Noise Robustness for Domain-Agnostic Fair Corrections via Nearest Neighbors Label Spreading
Nathan Stromberg, Rohan Ayyagari, Sanmi Koyejo, Richard Nock, Lalitha Sankar
TL;DR
This work tackles the problem of maximizing worst-group accuracy under symmetric label noise by making last-layer fairness corrections domain-agnostic. It introduces a plug-in preprocessing step: kNN label spreading in the latent embedding space to denoise labels, followed by existing two-stage last-layer corrections (RAD or SELF). The approach demonstrates state-of-the-art worst-group accuracy across several datasets under varying noise levels while adding minimal computational overhead. Key insights include the importance of embedding separability, the need to adapt the neighbor count to noise level, and the potential to extend domain-agnostic fairness corrections without domain annotations. Overall, the method offers a practical, scalable route to robust subgroup fairness in the presence of label noise.
Abstract
Last-layer retraining methods have emerged as an efficient framework for correcting existing base models. Within this framework, several methods have been proposed to deal with correcting models for subgroup fairness with and without group membership information. Importantly, prior work has demonstrated that many methods are susceptible to noisy labels. To this end, we propose a drop-in correction for label noise in last-layer retraining, and demonstrate that it achieves state-of-the-art worst-group accuracy for a broad range of symmetric label noise and across a wide variety of datasets exhibiting spurious correlations. Our proposed approach uses label spreading on a latent nearest neighbors graph and has minimal computational overhead compared to existing methods.
