Automating Governing Knowledge Commons and Contextual Integrity (GKC-CI) Privacy Policy Annotations with Large Language Models
Jake Chanenson, Madison Pickering, Noah Apthorpe
TL;DR
This work demonstrates that high-accuracy GKC-CI parameter annotations of privacy policies can be automated with fine-tuned large language models, achieving $90.65\%$ exact-match accuracy on a ground-truth set and enabling large-scale longitudinal and cross-industry analyses. Using LoRA-based PEFT across 50 models and a carefully designed sentence-level prompting regime, the authors show that open-source models struggle without fine-tuning, while a GPT-3.5 Turbo variant trained for 25 epochs attains strong performance and cost efficiency. The approach yields scalable insights into policy evolution, parameter-type variance, and density, and is complemented by a visualization tool and freely available data, code, and annotations to support future GKC-CI research. The work also highlights practical considerations, such as model alignment, library defaults, context-window limitations, and the potential for extending to non-policy documents, setting a path for normative privacy analysis at scale.
Abstract
Identifying contextual integrity (CI) and governing knowledge commons (GKC) parameters in privacy policy texts can facilitate normative privacy analysis. However, GKC-CI annotation has heretofore required manual or crowdsourced effort. This paper demonstrates that high-accuracy GKC-CI parameter annotation of privacy policies can be performed automatically using large language models. We fine-tune 50 open-source and proprietary models on 21,588 ground truth GKC-CI annotations from 16 privacy policies. Our best performing model has an accuracy of 90.65%, which is comparable to the accuracy of experts on the same task. We apply our best performing model to 456 privacy policies from a variety of online services, demonstrating the effectiveness of scaling GKC-CI annotation for privacy policy exploration and analysis. We publicly release our model training code, training and testing data, an annotation visualizer, and all annotated policies for future GKC-CI research.
