Recurrent Neural Network for Text Classification with Multi-Task Learning
Pengfei Liu, Xipeng Qiu, Xuanjing Huang
TL;DR
The paper tackles data scarcity in text classification by introducing three RNN-based multi-task architectures that share information across related tasks. It formalizes uniform-, coupled-, and shared-layer sharing with gating mechanisms and trains them jointly, with optional pretraining of the shared layer. Across four datasets, the multi-task models outperform single-task baselines and show competitive gains compared to state-of-the-art neural models, especially when fine-tuned or pre-trained. The work demonstrates that carefully designed cross-task sharing in RNNs can enhance text representations and transfer learning in NLP.
Abstract
Neural network based methods have obtained great progress on a variety of natural language processing tasks. However, in most previous works, the models are learned based on single-task supervised objectives, which often suffer from insufficient training data. In this paper, we use the multi-task learning framework to jointly learn across multiple related tasks. Based on recurrent neural network, we propose three different mechanisms of sharing information to model text with task-specific and shared layers. The entire network is trained jointly on all these tasks. Experiments on four benchmark text classification tasks show that our proposed models can improve the performance of a task with the help of other related tasks.
