Discovering the Hidden Role of Gini Index In Prompt-based Classification

Ruixi Lin

Discovering the Hidden Role of Gini Index In Prompt-based Classification

Ruixi Lin

Abstract

In classification tasks, the long-tailed minority classes usually offer the predictions that are most important. Yet these classes consistently exhibit low accuracies, whereas a few high-performing classes dominate the game. We pursue a foundational understanding of the hidden role of Gini Index as a tool for detecting and optimizing (debiasing) disparities in class accuracy, focusing on the case of prompt-based classification. We introduce the intuitions, benchmark Gini scores in real-world LLMs and vision models, and thoroughly discuss the insights of Gini not only as a measure of relative accuracy dominance but also as a direct optimization metric. Through rigorous case analyses, we first show that weak to strong relative accuracy imbalance exists in both prompt-based, text and image classification results and regardless of whether the classification is high-dimensional or low-dimensional. Then, we harness the Gini metric to propose a post-hoc model-agnostic bias mitigation method. Experimental results across few-shot news, biomedical, and zero-shot image classification show that our method significantly reduces both relative and absolute accuracy imbalances, minimizing top class relative dominance while elevating weakest classes.

Discovering the Hidden Role of Gini Index In Prompt-based Classification

Abstract

Paper Structure (21 sections, 10 equations, 1 figure, 8 tables)

This paper contains 21 sections, 10 equations, 1 figure, 8 tables.

Introduction
Related Work
The Inequality Measure: Gini Index
Prompt-based Classification
Measuring Class Accuracy Disparities
The Gini Index for Relative Class Accuracy Disparity in Prompt-Based Classification
Gini Index Definition
A Numerical Walkthrough of the Gini Index Calculation
Comparisons Between Gini Index And COBias
Benchmarking Gini Index of Class Accuracies in Prompt-based Classification Tasks
Case Analysis 1: Text Classification
Case Analysis 2: Image Classification
The Rationale
The Bias Mitigation Method
Optimization Experiments
...and 6 more sections

Figures (1)

Figure 1: Measurements of Gini index and related metrics for image classification (CIFAR-100; 100 classes)

Discovering the Hidden Role of Gini Index In Prompt-based Classification

Abstract

Discovering the Hidden Role of Gini Index In Prompt-based Classification

Authors

Abstract

Table of Contents

Figures (1)