PersonaMatrix: A Recipe for Persona-Aware Evaluation of Legal Summarization
Tsz Fung Pang, Maryam Berijanian, Thomas Orth, Breanna Shi, Charlotte S. Alexander
TL;DR
PersonaMatrix tackles the need for stakeholder-sensitive evaluation in legal summarization by introducing a persona-by-criterion framework and a controlled dimension-shifted dataset. It combines persona-conditioned evaluators with a new Diversity-Coverage Index (DCI) to quantify between-persona divergence and within-persona coherence. The approach uses an Extractor→Rewriter→Validator pipeline and LLM-driven rubrics to generate and assess variants across three conflicting quality dimensions. Results show statistically significant divergences between persona-aware and generic judges, with interior optima suggesting multi-objective trade-offs. This work provides a scalable, practitioner-friendly path toward more accessible and useful legal AI summaries, with code and data publicly available.
Abstract
Legal documents are often long, dense, and difficult to comprehend, not only for laypeople but also for legal experts. While automated document summarization has great potential to improve access to legal knowledge, prevailing task-based evaluators overlook divergent user and stakeholder needs. Tool development is needed to encompass the technicality of a case summary for a litigator yet be accessible for a self-help public researching for their lawsuit. We introduce PersonaMatrix, a persona-by-criterion evaluation framework that scores summaries through the lens of six personas, including legal and non-legal users. We also introduce a controlled dimension-shifted pilot dataset of U.S. civil rights case summaries that varies along depth, accessibility, and procedural detail as well as Diversity-Coverage Index (DCI) to expose divergent optima of legal summary between persona-aware and persona-agnostic judges. This work enables refinement of legal AI summarization systems for both expert and non-expert users, with the potential to increase access to legal knowledge. The code base and data are publicly available in GitHub.
