Supervised Gradual Machine Learning for Aspect Category Detection
Murtadha Ahmed, Qun Chen
TL;DR
This paper tackles Aspect Category Detection (ACD) under limited labeled data by introducing a supervised Gradual Machine Learning (GML) framework that couples deep neural networks (DNNs) for semantic relation modeling with a factor-graph based gradual inference mechanism. Semantic relations between sentence pairs are captured via a category-specific DNN and a Bert-based binary model, and are encoded as binary factors in a GML graph to progressively propagate labels from easy to hard instances. The method yields state-of-the-art performance on SemEval 2014/2016, MAMS, and SentiHood, with demonstrated data efficiency (e.g., competitive results at 30% of labeled data) and robustness to the number of relations $k_b$ used in inference. The approach offers a practical path to high-accuracy ACD in settings with scarce labeled data and could be extended to other fine-grained text classification tasks.
Abstract
Aspect Category Detection (ACD) aims to identify implicit and explicit aspects in a given review sentence. The state-of-the-art approaches for ACD use Deep Neural Networks (DNNs) to address the problem as a multi-label classification task. However, learning category-specific representations heavily rely on the amount of labeled examples, which may not readily available in real-world scenarios. In this paper, we propose a novel approach to tackle the ACD task by combining DNNs with Gradual Machine Learning (GML) in a supervised setting. we aim to leverage the strength of DNN in semantic relation modeling, which can facilitate effective knowledge transfer between labeled and unlabeled instances during the gradual inference of GML. To achieve this, we first analyze the learned latent space of the DNN to model the relations, i.e., similar or opposite, between instances. We then represent these relations as binary features in a factor graph to efficiently convey knowledge. Finally, we conduct a comparative study of our proposed solution on real benchmark datasets and demonstrate that the GML approach, in collaboration with DNNs for feature extraction, consistently outperforms pure DNN solutions.
