Enhance and Reuse: A Dual-Mechanism Approach to Boost Deep Forest for Label Distribution Learning
Jia-Le Xu, Shen-Huan Lyu, Yu-Nian Wang, Ning Chen, Zhihao Qu, Bin Tang, Baoliu Ye
TL;DR
This work addresses label distribution learning (LDL) by integrating a dual-mechanism extension to Deep Forest (DF). The Enhanced and Reused Feature Deep Forest (ERDF) introduces a feature enhancement module that explicitly leverages label correlations via PCA-derived relationship patterns and an input-feature mapping, plus a measure-aware feature reuse module that stabilizes training by selectively reusing lower-quality features. Ablation and benchmark studies on five LDL datasets demonstrate that both components are essential, with ERDF achieving superior performance across six metrics and showing robustness to hyperparameters. The approach offers a practical, scalable pathway to exploit label correlations in non-backpropagation-based deep ensembles and suggests broader applicability to other label-dependent tasks.
Abstract
Label distribution learning (LDL) requires the learner to predict the degree of correlation between each sample and each label. To achieve this, a crucial task during learning is to leverage the correlation among labels. Deep Forest (DF) is a deep learning framework based on tree ensembles, whose training phase does not rely on backpropagation. DF performs in-model feature transform using the prediction of each layer and achieves competitive performance on many tasks. However, its exploration in the field of LDL is still in its infancy. The few existing methods that apply DF to the field of LDL do not have effective ways to utilize the correlation among labels. Therefore, we propose a method named Enhanced and Reused Feature Deep Forest (ERDF). It mainly contains two mechanisms: feature enhancement exploiting label correlation and measure-aware feature reuse. The first one is to utilize the correlation among labels to enhance the original features, enabling the samples to acquire more comprehensive information for the task of LDL. The second one performs a reuse operation on the features of samples that perform worse than the previous layer on the validation set, in order to ensure the stability of the training process. This kind of Enhance-Reuse pattern not only enables samples to enrich their features but also validates the effectiveness of their new features and conducts a reuse process to prevent the noise from spreading further. Experiments show that our method outperforms other comparison algorithms on six evaluation metrics.
