Table of Contents
Fetching ...

Learning for Detecting Norm Violation in Online Communities

Thiago Freitas dos Santos, Nardine Osman, Marco Schorlemmer

TL;DR

This paper tackles the challenge of interpreting and enforcing online community norms by proposing a normative violation detection framework (FNVD) that not only detects violations but also explains to violators which features of their actions triggered the violation. FNVD combines Logistic Model Trees for interpretable probabilistic classification with K-Means clustering to identify the most influential action features, guided by a domain-specific taxonomy for explanation. The authors apply FNVD to Wikipedia vandalism detection, using a feature-rich taxonomy over 61 features derived from edit data and evaluated with 10-fold cross-validation, achieving an overall accuracy of $96\%$ and providing actionable feedback to violators. The work demonstrates a path toward adaptive, explainable norm enforcement in online communities, with online retraining and feedback mechanisms proposed for handling evolving norms and broader domains.

Abstract

In this paper, we focus on normative systems for online communities. The paper addresses the issue that arises when different community members interpret these norms in different ways, possibly leading to unexpected behavior in interactions, usually with norm violations that affect the individual and community experiences. To address this issue, we propose a framework capable of detecting norm violations and providing the violator with information about the features of their action that makes this action violate a norm. We build our framework using Machine Learning, with Logistic Model Trees as the classification algorithm. Since norm violations can be highly contextual, we train our model using data from the Wikipedia online community, namely data on Wikipedia edits. Our work is then evaluated with the Wikipedia use case where we focus on the norm that prohibits vandalism in Wikipedia edits.

Learning for Detecting Norm Violation in Online Communities

TL;DR

This paper tackles the challenge of interpreting and enforcing online community norms by proposing a normative violation detection framework (FNVD) that not only detects violations but also explains to violators which features of their actions triggered the violation. FNVD combines Logistic Model Trees for interpretable probabilistic classification with K-Means clustering to identify the most influential action features, guided by a domain-specific taxonomy for explanation. The authors apply FNVD to Wikipedia vandalism detection, using a feature-rich taxonomy over 61 features derived from edit data and evaluated with 10-fold cross-validation, achieving an overall accuracy of and providing actionable feedback to violators. The work demonstrates a path toward adaptive, explainable norm enforcement in online communities, with online retraining and feedback mechanisms proposed for handling evolving norms and broader domains.

Abstract

In this paper, we focus on normative systems for online communities. The paper addresses the issue that arises when different community members interpret these norms in different ways, possibly leading to unexpected behavior in interactions, usually with norm violations that affect the individual and community experiences. To address this issue, we propose a framework capable of detecting norm violations and providing the violator with information about the features of their action that makes this action violate a norm. We build our framework using Machine Learning, with Logistic Model Trees as the classification algorithm. Since norm violations can be highly contextual, we train our model using data from the Wikipedia online community, namely data on Wikipedia edits. Our work is then evaluated with the Wikipedia use case where we focus on the norm that prohibits vandalism in Wikipedia edits.

Paper Structure

This paper contains 15 sections, 2 equations, 6 figures, 2 tables.

Figures (6)

  • Figure 1: An example of a tree built by the LMT algorithmlandwehr2005logistic. X$_1$ and X$_2$ are features present in the dataset. F$_1$ and F$_2$ are the equations found by the logistic regression model, describing the weights for each feature present in the training dataset.
  • Figure 2: How the framework works when deployed in an online community.
  • Figure 3: Taxonomy associated with Wikipedia's 'no vandalism' norm.
  • Figure 4: Taxonomy for part of the features that were most relevant for the vandalism classification. These features are then presented to the user with a descriptive text.
  • Figure 5: The built model for the vandalism detection, using Logistic Model Tree.
  • ...and 1 more figures