LegalGuardian: A Privacy-Preserving Framework for Secure Integration of Large Language Models in Legal Practice
M. Mikail Demir, Hakan T. Otal, M. Abdullah Canbaz
TL;DR
The paper tackles confidentiality risks when deploying Large Language Models in legal practice by introducing LegalGuardian, a lightweight privacy-preserving framework that masks PII in prompts using Named Entity Recognition and local LLMs, enabling secure interaction with external AI tools. It combines synthetic data generation for immigration-law scenarios with a four-layer pipeline (data generation, PII masking, secure prompting, evaluation) and evaluates two masking strategies (GLiNER and Qwen2.5-14B) on 460 entities. Results show complementary strengths: GLiNER offers high precision and structured-entity accuracy, while Qwen2.5-14B provides higher recall and broader entity detection, with semantic fidelity largely preserved. The work demonstrates a practical path for lawyers to leverage AI while upholding client confidentiality, suggesting a hybrid approach and outlining future extensions to broader legal domains and privacy-preserving cloud integrations.
Abstract
Large Language Models (LLMs) hold promise for advancing legal practice by automating complex tasks and improving access to justice. However, their adoption is limited by concerns over client confidentiality, especially when lawyers include sensitive Personally Identifiable Information (PII) in prompts, risking unauthorized data exposure. To mitigate this, we introduce LegalGuardian, a lightweight, privacy-preserving framework tailored for lawyers using LLM-based tools. LegalGuardian employs Named Entity Recognition (NER) techniques and local LLMs to mask and unmask confidential PII within prompts, safeguarding sensitive data before any external interaction. We detail its development and assess its effectiveness using a synthetic prompt library in immigration law scenarios. Comparing traditional NER models with one-shot prompted local LLM, we find that LegalGuardian achieves a F1-score of 93% with GLiNER and 97% with Qwen2.5-14B in PII detection. Semantic similarity analysis confirms that the framework maintains high fidelity in outputs, ensuring robust utility of LLM-based tools. Our findings indicate that legal professionals can harness advanced AI technologies without compromising client confidentiality or the quality of legal documents.
