The impact of abstract and object tags on image privacy classification
Darya Baranouskaya, Andrea Cavallaro
TL;DR
The paper investigates whether abstract or concrete tag types are more informative for image privacy classification. Using ClarifAI-generated tags and a controlled feature-selection pipeline, it compares abstract, concrete, and combined tag representations across three privacy datasets, varying the number of tags per image. Key findings show abstract tags excel when tag budgets are small and subjectivity is high, while larger tag budgets reduce the advantage of abstract information, allowing concrete or mixed tags to perform comparably. The analysis also reveals limited direct co-occurrence between abstract and concrete tags, suggesting that combined tag sets capture complementary cues when abundant tagging is available. Practically, the work guides the design of interpretable privacy classifiers by incorporating abstract concepts, especially for subjective tasks, while demonstrating budget-driven trade-offs between tag types.
Abstract
Object tags denote concrete entities and are central to many computer vision tasks, whereas abstract tags capture higher-level information, which is relevant for tasks that require a contextual, potentially subjective scene understanding. Object and abstract tags extracted from images also facilitate interpretability. In this paper, we explore which type of tags is more suitable for the context-dependent and inherently subjective task of image privacy. While object tags are generally used for privacy classification, we show that abstract tags are more effective when the tag budget is limited. Conversely, when a larger number of tags per image is available, object-related information is as useful. We believe that these findings will guide future research in developing more accurate image privacy classifiers, informed by the role of tag types and quantity.
