Getting in the Door: Streamlining Intake in Civil Legal Services with Large Language Models
Quinten Steenhuis, Hannes Westermann
TL;DR
This paper tackles the bottleneck of civil legal aid intake by integrating formal eligibility rules with large language models in a Docassemble-based housing intake platform. It evaluates eight LLMs, finding that GPT-4-turbo achieves the best overall F1 score around 0.82, with a strong performance in jurisdictions that have more detailed rules. The study demonstrates that LLMs can safely pre-screen applicants, eliciting missing information through follow-ups while reducing false negatives, though errors mostly pertain to follow-up planning and rule interpretation. The work highlights practical implications for accelerating access to justice and lays out future enhancements, including deeper integration, analytics, and cross-provider eligibility matching to further expand reach.
Abstract
Legal intake, the process of finding out if an applicant is eligible for help from a free legal aid program, takes significant time and resources. In part this is because eligibility criteria are nuanced, open-textured, and require frequent revision as grants start and end. In this paper, we investigate the use of large language models (LLMs) to reduce this burden. We describe a digital intake platform that combines logical rules with LLMs to offer eligibility recommendations, and we evaluate the ability of 8 different LLMs to perform this task. We find promising results for this approach to help close the access to justice gap, with the best model reaching an F1 score of .82, while minimizing false negatives.
