Little Data, Big Impact: Privacy-Aware Visual Language Models via Minimal Tuning
Laurens Samson, Nimrod Barazani, Sennay Ghebreab, Yuki M. Asano
TL;DR
This work investigates privacy-awareness in Visual Language Models (VLMs) and identifies major gaps in recognizing privacy-sensitive content. It introduces two GDPR-aligned privacy benchmarks, PrivBench and PrivBench-H, and a privacy-tuning dataset, PrivTune, to enable targeted instruction tuning. The authors demonstrate that privacy-tuning on as few as 100 samples yields substantial improvements across privacy benchmarks—surpassing strong baselines like GPT-4 in several settings—while incurring only modest impact on non-privacy tasks. They also expose label noise in existing privacy datasets and show strong generalization to unseen privacy categories and multilingual contexts. Finally, they illustrate practical applicability by analyzing privacy across large image collections (e.g., Places365), highlighting the potential for scalable privacy screening in real-world deployments.
Abstract
As Visual Language Models (VLMs) become increasingly embedded in everyday applications, ensuring they can recognize and appropriately handle privacy-sensitive content is essential. We conduct a comprehensive evaluation of ten state-of-the-art VLMs and identify limitations in their understanding of visual privacy. Existing datasets suffer from label inconsistencies, limiting their reliability. To address this, we introduce two compact, high-quality benchmarks, PrivBench and PrivBench-H, that focus on commonly recognized privacy categories aligned with the General Data Protection Regulation (GDPR). Additionally, we present PrivTune, an instruction-tuning dataset specifically curated to improve privacy sensitivity. We obtain a Privacy VLM by fine-tuning an off-the-shelf VLM on only 100 samples from PrivTune, which leads to substantial gains on all benchmarks, surpassing GPT-4, while maintaining strong performance on other tasks. Our findings show that privacy-awareness in VLMs can be substantially improved with minimal data and careful dataset design, setting the stage for safer, more privacy-aligned AI systems.
