Diverse Prototypical Ensembles Improve Robustness to Subpopulation Shift

Minh Nguyen Nhat To; Paul F RWilson; Viet Nguyen; Mohamed Harmanani; Michael Cooper; Fahimeh Fooladgar; Purang Abolmaesumi; Parvin Mousavi; Rahul G. Krishnan

Diverse Prototypical Ensembles Improve Robustness to Subpopulation Shift

Minh Nguyen Nhat To, Paul F RWilson, Viet Nguyen, Mohamed Harmanani, Michael Cooper, Fahimeh Fooladgar, Purang Abolmaesumi, Parvin Mousavi, Rahul G. Krishnan

TL;DR

This paper tackles subpopulation shift by introducing Diverse Prototypical Ensembles (DPE), a two-stage method that keeps a fixed feature extractor and replaces the classifier with a diversified, distance-based prototypical ensemble. By jointly training multiple prototypes per class with explicit inter-prototype similarity regularization and bootstrap diversification, DPE discovers and covers diverse subpopulations without requiring subgroup annotations. Empirical results across nine real-world benchmarks show that DPE improves worst-group accuracy, often surpassing state-of-the-art methods, while maintaining competitive standard accuracy. The approach offers a scalable, annotation-free path to fairness in deployment settings where subgroup labels are unavailable or costly to obtain.

Abstract

The subpopulationtion shift, characterized by a disparity in subpopulation distributibetween theween the training and target datasets, can significantly degrade the performance of machine learning models. Current solutions to subpopulation shift involve modifying empirical risk minimization with re-weighting strategies to improve generalization. This strategy relies on assumptions about the number and nature of subpopulations and annotations on group membership, which are unavailable for many real-world datasets. Instead, we propose using an ensemble of diverse classifiers to adaptively capture risk associated with subpopulations. Given a feature extractor network, we replace its standard linear classification layer with a mixture of prototypical classifiers, where each member is trained to classify the data while focusing on different features and samples from other members. In empirical evaluation on nine real-world datasets, covering diverse domains and kinds of subpopulation shift, our method of Diverse Prototypical Ensembles (DPEs) often outperforms the prior state-of-the-art in worst-group accuracy. The code is available at https://github.com/minhto2802/dpe4subpop

Diverse Prototypical Ensembles Improve Robustness to Subpopulation Shift

TL;DR

Abstract

Diverse Prototypical Ensembles Improve Robustness to Subpopulation Shift

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (15)