Automating Transparency Mechanisms in the Judicial System Using LLMs: Opportunities and Challenges
Ishana Shastri, Shomik Jain, Barbara Engelhardt, Ashia Wilson
TL;DR
The work addresses the challenge of making judicial processes more transparent by leveraging large language models to extract structured signals from unstructured court documents. It analyzes two case studies—jury selection in criminal trials and eviction proceedings—to evaluate LLM capabilities in information extraction, while highlighting limitations in accuracy and inference requirements. The results show heterogeneous performance (e.g., 81.6% for selected juror names, 3.6% for gender composition, 95.8% zipcode), underscoring the need for targeted technical and legal investments, standardized data, and human oversight. The study provides a roadmap for improving data accessibility, pre-processing, and evaluation benchmarks to responsibly scale automated transparency in the judiciary and reduce potential disparities.
Abstract
Bringing more transparency to the judicial system for the purposes of increasing accountability often demands extensive effort from auditors who must meticulously sift through numerous disorganized legal case files to detect patterns of bias and errors. For example, the high-profile investigation into the Curtis Flowers case took seven reporters a full year to assemble evidence about the prosecutor's history of selecting racially biased juries. LLMs have the potential to automate and scale these transparency pipelines, especially given their demonstrated capabilities to extract information from unstructured documents. We discuss the opportunities and challenges of using LLMs to provide transparency in two important court processes: jury selection in criminal trials and housing eviction cases.
