Lightweight Contenders: Navigating Semi-Supervised Text Mining through Peer Collaboration and Self Transcendence
Qianren Mao, Weifeng Jiang, Junnan Liu, Chenghua Lin, Qian Li, Xianqing Wen, Jianxin Li, Jinhu Lu
TL;DR
PS-NET tackles semi-supervised learning for lightweight NLP models under severe label scarcity by combining online distillation from a teacher, mutual learning among multiple small student networks, and curriculum adversarial perturbations to progressively generalize. The framework distills knowledge from a deeper teacher into two-layer or small student networks and trains them jointly on labeled and unlabeled data, reinforced by peer interaction and self-augmentation. Empirical results on SSL text classification and extractive summarization show PS-NET outperforms state-of-the-art lightweight SSL baselines such as FLiText and DisCo, while using far fewer parameters. This approach enables efficient deployment on resource-constrained devices and provides a scalable path for SSL in compact models across tasks.
Abstract
The semi-supervised learning (SSL) strategy in lightweight models requires reducing annotated samples and facilitating cost-effective inference. However, the constraint on model parameters, imposed by the scarcity of training labels, limits the SSL performance. In this paper, we introduce PS-NET, a novel framework tailored for semi-supervised text mining with lightweight models. PS-NET incorporates online distillation to train lightweight student models by imitating the Teacher model. It also integrates an ensemble of student peers that collaboratively instruct each other. Additionally, PS-NET implements a constant adversarial perturbation schema to further self-augmentation by progressive generalizing. Our PS-NET, equipped with a 2-layer distilled BERT, exhibits notable performance enhancements over SOTA lightweight SSL frameworks of FLiText and DisCo in SSL text classification with extremely rare labelled data.
