Beyond Bias Scores: Unmasking Vacuous Neutrality in Small Language Models
Sumanth Manduru, Carlotta Domeniconi
TL;DR
The paper addresses the fairness of small language models (0.5B–5B) by introducing VaNeu, a four-stage framework—Bias, Utility, Ambiguity Handling, and Positional Bias—for pre-deployment evaluation. It conducts a large-scale audit across nine open-source SLMs from four families using BBQ, StereoSet, and CrowS-Pairs, revealing that models with low bias can still perform poorly under ambiguity or display biased response patterns due to positional heuristics. Key findings show that the Phi family often achieves robust Utility and Ambiguity Handling with minimal Positional Bias, while other families exhibit vacuous neutrality—apparent fairness paired with unreliable reasoning. The work argues for multidimensional fairness assessment prior to deployment and highlights directions for formalizing Vacuous Neutrality and developing a composite fairness metric with practical implications for responsible use of SLMs in sensitive settings.
Abstract
The rapid adoption of Small Language Models (SLMs) for resource constrained applications has outpaced our understanding of their ethical and fairness implications. To address this gap, we introduce the Vacuous Neutrality Framework (VaNeu), a multi-dimensional evaluation paradigm designed to assess SLM fairness prior to deployment. The framework examines model robustness across four stages - biases, utility, ambiguity handling, and positional bias over diverse social bias categories. To the best of our knowledge, this work presents the first large-scale audit of SLMs in the 0.5-5B parameter range, an overlooked "middle tier" between BERT-class encoders and flagship LLMs. We evaluate nine widely used SLMs spanning four model families under both ambiguous and disambiguated contexts. Our findings show that models demonstrating low bias in early stages often fail subsequent evaluations, revealing hidden vulnerabilities and unreliable reasoning. These results underscore the need for a more comprehensive understanding of fairness and reliability in SLMs, and position the proposed framework as a principled tool for responsible deployment in socially sensitive settings.
