GWPT: A Green Word-Embedding-based POS Tagger
Chengwei Wei, Runqi Pang, C. -C. Jay Kuo
TL;DR
GWPT tackles the need for fast, energy-efficient POS tagging by proposing a green learning-based tagger that avoids heavy DL architectures. It employs a three-stage cascade—representation learning from word embeddings with frequency-based dimension partitioning and adaptive N-grams, followed by discriminant feature selection via the discriminant feature test (DFT) and an XGBoost classifier—for POS prediction. A key component is frequency analysis of embedding dimensions to guide N-gram choices and dimensionality reduction, enabling compact representations without sacrificing accuracy. Experiments on PTB and UD show GWPT achieving competitive tagging accuracy with substantially fewer parameters and lower FLOPs than DL-based taggers and MultiBPEmb, making it suitable for edge devices and energy-constrained settings; future work may incorporate character embeddings or lighter classifiers for multi-class POS tagging.
Abstract
As a fundamental tool for natural language processing (NLP), the part-of-speech (POS) tagger assigns the POS label to each word in a sentence. A novel lightweight POS tagger based on word embeddings is proposed and named GWPT (green word-embedding-based POS tagger) in this work. Following the green learning (GL) methodology, GWPT contains three modules in cascade: 1) representation learning, 2) feature learning, and 3) decision learning modules. The main novelty of GWPT lies in representation learning. It uses non-contextual or contextual word embeddings, partitions embedding dimension indices into low-, medium-, and high-frequency sets, and represents them with different N-grams. It is shown by experimental results that GWPT offers state-of-the-art accuracies with fewer model parameters and significantly lower computational complexity in both training and inference as compared with deep-learning-based methods.
