Table of Contents
Fetching ...

Enhancing Web Spam Detection through a Blockchain-Enabled Crowdsourcing Mechanism

Noah Kader, Inwon Kang, Oshani Seneviratne

TL;DR

Simulations show that incentivized crowdsourcing improves data quality, leading to more effective machine-learning models for spam detection, and offers a scalable and adaptable solution to the challenges of traditional methods.

Abstract

The proliferation of spam on the Web has necessitated the development of machine learning models to automate their detection. However, the dynamic nature of spam and the sophisticated evasion techniques employed by spammers often lead to low accuracy in these models. Traditional machine-learning approaches struggle to keep pace with spammers' constantly evolving tactics, resulting in a persistent challenge to maintain high detection rates. To address this, we propose blockchain-enabled incentivized crowdsourcing as a novel solution to enhance spam detection systems. We create an incentive mechanism for data collection and labeling by leveraging blockchain's decentralized and transparent framework. Contributors are rewarded for accurate labels and penalized for inaccuracies, ensuring high-quality data. A smart contract governs the submission and evaluation process, with participants staking cryptocurrency as collateral to guarantee integrity. Simulations show that incentivized crowdsourcing improves data quality, leading to more effective machine-learning models for spam detection. This approach offers a scalable and adaptable solution to the challenges of traditional methods.

Enhancing Web Spam Detection through a Blockchain-Enabled Crowdsourcing Mechanism

TL;DR

Simulations show that incentivized crowdsourcing improves data quality, leading to more effective machine-learning models for spam detection, and offers a scalable and adaptable solution to the challenges of traditional methods.

Abstract

The proliferation of spam on the Web has necessitated the development of machine learning models to automate their detection. However, the dynamic nature of spam and the sophisticated evasion techniques employed by spammers often lead to low accuracy in these models. Traditional machine-learning approaches struggle to keep pace with spammers' constantly evolving tactics, resulting in a persistent challenge to maintain high detection rates. To address this, we propose blockchain-enabled incentivized crowdsourcing as a novel solution to enhance spam detection systems. We create an incentive mechanism for data collection and labeling by leveraging blockchain's decentralized and transparent framework. Contributors are rewarded for accurate labels and penalized for inaccuracies, ensuring high-quality data. A smart contract governs the submission and evaluation process, with participants staking cryptocurrency as collateral to guarantee integrity. Simulations show that incentivized crowdsourcing improves data quality, leading to more effective machine-learning models for spam detection. This approach offers a scalable and adaptable solution to the challenges of traditional methods.
Paper Structure (22 sections, 1 equation, 5 figures, 1 algorithm)

This paper contains 22 sections, 1 equation, 5 figures, 1 algorithm.

Figures (5)

  • Figure 1: System Overview
  • Figure 2: Incentive Mechanism Codified in the Smart Contract
  • Figure 3: Feature Analysis
  • Figure 4: Model Evaluation With Increasing Dataset Sizes
  • Figure 5: Incentive Mechanism Simulation Results