"Define Your Terms" : Enhancing Efficient Offensive Speech Classification with Definition
Huy Nghiem, Umang Gupta, Fred Morstatter
TL;DR
Problem: cross-domain offensive speech detection with limited labeled data faces semantic diversity and domain shift. Method: JE_ProtoNet, a joint embedding Prototypical Network that uses label definitions via attention to align inputs with label semantics, trained with meta-learning across 10 domains and evaluated on 4 held-out domains. Contributions: compilation of 14 datasets (~82k training samples across 10 domains), demonstration that label-definition-aware joint embeddings yield robust few-shot performance (achieving at least $75\%$ of the maximal $F1$-score with $<10\%$ of data), and extensive ablations showing the benefit of definition-aware signaling. Significance: enhances data efficiency for offensive-speech detection and provides a framework likely applicable to other label-rich NLP tasks such as sentiment or stance detection.
Abstract
The propagation of offensive content through social media channels has garnered attention of the research community. Multiple works have proposed various semantically related yet subtle distinct categories of offensive speech. In this work, we explore meta-earning approaches to leverage the diversity of offensive speech corpora to enhance their reliable and efficient detection. We propose a joint embedding architecture that incorporates the input's label and definition for classification via Prototypical Network. Our model achieves at least 75% of the maximal F1-score while using less than 10% of the available training data across 4 datasets. Our experimental findings also provide a case study of training strategies valuable to combat resource scarcity.
