Analyzing Bias in Swiss Federal Supreme Court Judgments Using Facebook's Holistic Bias Dataset: Implications for Language Model Training

Sabine Wehnert; Muhammet Ertas; Ernesto William De Luca

Analyzing Bias in Swiss Federal Supreme Court Judgments Using Facebook's Holistic Bias Dataset: Implications for Language Model Training

Sabine Wehnert, Muhammet Ertas, Ernesto William De Luca

TL;DR

Problem: biases in the Swiss Judgement Prediction Dataset may propagate to NLP judgments. Approach: leverage Holistic Bias dispreferred descriptors, multilingual expansion to German/French/Italian, extractive summarization and chunking to respect a $512$-token limit, fine-tune a legal-domain RoBERTa-large, and assess with a Binomial Significance Test and attention visualization. Contributions: (a) descriptor-based bias analysis in the SJP dataset across languages; (b) evidence that certain descriptors (e.g., 'victime', 'Opfer') correlate with biased predictions or translation artifacts; (c) demonstration of attention-attribution patterns and limitations of chunking and translation; (d) practical considerations for bias-aware training in multilingual legal NLP. Significance: informs data curation and model training to mitigate bias in legal NLP applications.

Abstract

Natural Language Processing (NLP) is vital for computers to process and respond accurately to human language. However, biases in training data can introduce unfairness, especially in predicting legal judgment. This study focuses on analyzing biases within the Swiss Judgment Prediction Dataset (SJP-Dataset). Our aim is to ensure unbiased factual descriptions essential for fair decision making by NLP models in legal contexts. We analyze the dataset using social bias descriptors from the Holistic Bias dataset and employ advanced NLP techniques, including attention visualization, to explore the impact of dispreferred descriptors on model predictions. The study identifies biases and examines their influence on model behavior. Challenges include dataset imbalance and token limits affecting model performance.

Analyzing Bias in Swiss Federal Supreme Court Judgments Using Facebook's Holistic Bias Dataset: Implications for Language Model Training

TL;DR

-token limit, fine-tune a legal-domain RoBERTa-large, and assess with a Binomial Significance Test and attention visualization. Contributions: (a) descriptor-based bias analysis in the SJP dataset across languages; (b) evidence that certain descriptors (e.g., 'victime', 'Opfer') correlate with biased predictions or translation artifacts; (c) demonstration of attention-attribution patterns and limitations of chunking and translation; (d) practical considerations for bias-aware training in multilingual legal NLP. Significance: informs data curation and model training to mitigate bias in legal NLP applications.

Abstract

Paper Structure (20 sections, 3 equations, 6 figures, 7 tables)

This paper contains 20 sections, 3 equations, 6 figures, 7 tables.

Introduction
Related Work
Bias Analysis in Swiss Federal Court Judgments
Selecting Bias-Descriptors
Dataset Translation
Preprocessing
Extractive Summarization:
Chunking:
Model Fine-Tuning
Dataset Analysis with the Binomial Significance Test
Overview of the Dataset and Null Hypothesis
Testing the "Dismissal" Outcome for a Specific Token
Significance Threshold
Testing Both Outcomes
Analysis of Language Model Performance
...and 5 more sections

Figures (6)

Figure 1: Holistic Bias Dataset - Version 1.1 with respectively “dispreferred” labeled descriptors.
Figure 2: Occurrences of Derived Descriptors in German Training Data, by Instance Label.
Figure 3: Results of the binomial significance test for chunked data.
Figure 4: Classification Performance on German Test Data per Contained Descriptor.
Figure 5: Classification Performance on French Test Data per Contained Descriptor.
...and 1 more figures

Analyzing Bias in Swiss Federal Supreme Court Judgments Using Facebook's Holistic Bias Dataset: Implications for Language Model Training

TL;DR

Abstract

Analyzing Bias in Swiss Federal Supreme Court Judgments Using Facebook's Holistic Bias Dataset: Implications for Language Model Training

Authors

TL;DR

Abstract

Table of Contents

Figures (6)