Deep-Wide Learning Assistance for Insect Pest Classification
Toan Nguyen, Huy Nguyen, Huy Ung, Hieu Ung, Binh Nguyen
TL;DR
DeWi introduces a Deep-Wide learning framework for insect pest classification that alternates between a Deep step (triplet-margin loss with a multi-level feature extractor) and a Wide step (Mixup augmentation) to jointly boost discrimination and generalization. The Deep step forms 8192-d embeddings from dual projectors and optimizes a combined loss $L = \beta_1 L_{CE} + \beta_2 L_T$, where $L_T$ is the batch-hard triplet margin loss with distance $D$ and margin $m$, while the Wide step uses Mixup to generate augmented samples with mixed labels. Empirically, DeWi achieves state-of-the-art results on IP102 (Acc $=76.44\%$, $mF1=69.46$, $GM=65.07$) and D0 (Acc $=99.79\%$), and extensive ablations validate the importance of each component, the chosen margin, and the superiority of the one-stage approach over self-supervised pretraining. The framework is backbone-friendly, efficient, and scalable, with potential extensions to Vision Transformers and additional augmentation and contrastive losses, offering practical value for smart agriculture deployments.
Abstract
Accurate insect pest recognition plays a critical role in agriculture. It is a challenging problem due to the intricate characteristics of insects. In this paper, we present DeWi, novel learning assistance for insect pest classification. With a one-stage and alternating training strategy, DeWi simultaneously improves several Convolutional Neural Networks in two perspectives: discrimination (by optimizing a triplet margin loss in a supervised training manner) and generalization (via data augmentation). From that, DeWi can learn discriminative and in-depth features of insect pests (deep) yet still generalize well to a large number of insect categories (wide). Experimental results show that DeWi achieves the highest performances on two insect pest classification benchmarks (76.44\% accuracy on the IP102 dataset and 99.79\% accuracy on the D0 dataset, respectively). In addition, extensive evaluations and ablation studies are conducted to thoroughly investigate our DeWi and demonstrate its superiority. Our source code is available at https://github.com/toannguyen1904/DeWi.
