Addressing Bias Through Ensemble Learning and Regularized Fine-Tuning

Ahmed Radwan; Layan Zaafarani; Jetana Abudawood; Faisal AlZahrani; Fares Fourati

Addressing Bias Through Ensemble Learning and Regularized Fine-Tuning

Ahmed Radwan, Layan Zaafarani, Jetana Abudawood, Faisal AlZahrani, Fares Fourati

TL;DR

The paper tackles bias in AI models when labeled data are scarce and pretrained models may encode biases. It introduces a pipeline that creates multiple models from a biased pretrained model via data-splitting and regularized fine-tuning, ensembles their predictions with logit summation to reduce bias, and then distills the ensemble into a single unbiased network for efficient inference. The core objective during regularized fine-tuning is $ \min _{\\theta} \\mathbb{E}_{(x, y) \\sim \\mathcal{D}} [ \\mathcal{L}(\\theta, x, y) + \\lambda \\|\\theta - \\theta^*\\|^2 ] $, optionally with $ \\beta \\|\\theta\\|^2 $; experiments on CIFAR-10 and HAM10000 using DenseNet-121 show improved performance for underrepresented classes and strong overall accuracy via ensembles, though distillation with logits-based loss can be challenging. The work provides a practical framework for unbiased deployment under data constraints and highlights the trade-offs between regularization strength, ensemble gains, and distillation feasibility.

Abstract

Addressing biases in AI models is crucial for ensuring fair and accurate predictions. However, obtaining large, unbiased datasets for training can be challenging. This paper proposes a comprehensive approach using multiple methods to remove bias in AI models, with only a small dataset and a potentially biased pretrained model. We train multiple models with the counter-bias of the pre-trained model through data splitting, local training, and regularized fine-tuning, gaining potentially counter-biased models. Then, we employ ensemble learning for all models to reach unbiased predictions. To further accelerate the inference time of our ensemble model, we conclude our solution with knowledge distillation that results in a single unbiased neural network. We demonstrate the effectiveness of our approach through experiments on the CIFAR10 and HAM10000 datasets, showcasing promising results. This work contributes to the ongoing effort to create more unbiased and reliable AI models, even with limited data availability.

Addressing Bias Through Ensemble Learning and Regularized Fine-Tuning

TL;DR

Abstract

Addressing Bias Through Ensemble Learning and Regularized Fine-Tuning

Authors

TL;DR

Abstract

Table of Contents

Figures (2)