Spectral Clustering for Crowdsourcing with Inherently Distinct Task Types
Saptarshi Mandal, Seo Taek Kong, Dimitrios Katselis, R. Srikant
TL;DR
The paper addresses crowdsourcing with tasks of inherently distinct types (easy vs hard) by extending the Dawid-Skene framework to a two-type model. It introduces a spectral clustering method to partition tasks by type, achieving perfect clustering when the number of workers scales as $n = \Theta(\log d)$, enabling per-type application of DS-based label estimation (TE for reliabilities and NP-WMV for labels). The authors provide rigorous concentration and perturbation analyses, including a novel use of low-rank plus sparse structures and eigenvector perturbation results, to guarantee accurate clustering and fast-decaying labeling error. Empirical evaluations on real and pseudo-real datasets show that clustering by task type before label estimation improves performance in most scenarios, validating the practical value of the proposed two-step approach.
Abstract
The Dawid-Skene model is the most widely assumed model in the analysis of crowdsourcing algorithms that estimate ground-truth labels from noisy worker responses. In this work, we are motivated by crowdsourcing applications where workers have distinct skill sets and their accuracy additionally depends on a task's type. While weighted majority vote (WMV) with a single weight vector for each worker achieves the optimal label estimation error in the Dawid-Skene model, we show that different weights for different types are necessary for a multi-type model. Focusing on the case where there are two types of tasks, we propose a spectral method to partition tasks into two groups that cluster tasks by type. Our analysis reveals that task types can be perfectly recovered if the number of workers $n$ scales logarithmically with the number of tasks $d$. Any algorithm designed for the Dawid-Skene model can then be applied independently to each type to infer the labels. Numerical experiments show how clustering tasks by type before estimating ground-truth labels enhances the performance of crowdsourcing algorithms in practical applications.
