Examining the Utility of Self-disclosure Types for Modeling Annotators of Social Norms
Kieran Henderson, Kian Omoomi, Vasudha Varadarajan, Allison Lahnala, Charles Welch
TL;DR
This work investigates how self-disclosure types about annotators influence the modeling of subjective judgments in social norms tasks. It combines a theory-based taxonomy (derived from Social Penetration Theory) with automatic clustering to predict annotator verdicts on Reddit AITA data, showing that demographic information is the most predictive self-disclosure type and that a diverse, small set of annotator disclosures yields the best performance. Contrary to prior work, a few highly similar posts outperform using many posts, and theory-based categorizations generally outperform automatic clusters. The findings offer practical guidance for perspectivist NLP and demonstrate the value of richer, structured identity representations in modeling human subjectivity for NLP tasks.
Abstract
Recent work has explored the use of personal information in the form of persona sentences or self-disclosures to improve modeling of individual characteristics and prediction of annotator labels for subjective tasks. The volume of personal information has historically been restricted and thus little exploration has gone into understanding what kind of information is most informative for predicting annotator labels. In this work, we categorize self-disclosure sentences and use them to build annotator models for predicting judgments of social norms. We perform several ablations and analyses to examine the impact of the type of information on our ability to predict annotation patterns. We find that demographics are more impactful than attitudes, relationships, and experiences. Generally, theory-based approaches worked better than automatic clusters. Contrary to previous work, only a small number of related comments are needed. Lastly, having a more diverse sample of annotator self-disclosures leads to the best performance.
