Few-Shot Fairness: Unveiling LLM's Potential for Fairness-Aware Classification
Garima Chhikara, Anurag Sharma, Kripabandhu Ghosh, Abhijnan Chakraborty
TL;DR
The paper addresses whether large language models can produce fairness-aware classifications when guided by prompt-based in-context learning. It introduces a fairness framework with abstract (π_A) and detailed (π_D) rules, and evaluates zero-shot and few-shot settings using Retrieval-Augmented Generation to select demonstrations on the UCI Adult dataset across GPT-4, LLaMA-2, and Gemini. GPT-4 consistently achieves the best balance of accuracy and fairness across multiple definitions, though metrics like Disparate Impact, TPR, and FPR reveal persisting group biases. Generic Fairness prompts perform comparably to abstract definitions, highlighting the potential of simple, broad fairness signals to improve outputs, while also underscoring the need for broader datasets and models to reduce remaining biases before deployment.
Abstract
Employing Large Language Models (LLM) in various downstream applications such as classification is crucial, especially for smaller companies lacking the expertise and resources required for fine-tuning a model. Fairness in LLMs helps ensure inclusivity, equal representation based on factors such as race, gender and promotes responsible AI deployment. As the use of LLMs has become increasingly prevalent, it is essential to assess whether LLMs can generate fair outcomes when subjected to considerations of fairness. In this study, we introduce a framework outlining fairness regulations aligned with various fairness definitions, with each definition being modulated by varying degrees of abstraction. We explore the configuration for in-context learning and the procedure for selecting in-context demonstrations using RAG, while incorporating fairness rules into the process. Experiments conducted with different LLMs indicate that GPT-4 delivers superior results in terms of both accuracy and fairness compared to other models. This work is one of the early attempts to achieve fairness in prediction tasks by utilizing LLMs through in-context learning.
