Improving Detection of Rare Nodes in Hierarchical Multi-Label Learning
Isaac Xu, Martin Gillis, Ayushi Sharma, Benjamin Misiuk, Craig J. Brown, Thomas Trappenberg
TL;DR
This work targets the persistent challenge of detecting rare, deep hierarchical nodes in hierarchical multi-label learning (HML). It introduces a node-centered loss that combines imbalance weighting with a focal weighting term derived from ensemble uncertainty within a Coherent HML Neural Network (C-HMCNN). Through extensive experiments on FUN/GO gene-product datasets and BenthicNet-E, the approach yields large recall and F$_1$ gains for rare nodes, with notable improvements when using uncertainty-based focal terms such as bBMA and GMU, especially as ensemble size grows. The results demonstrate robustness to suboptimal encoders and limited data, offering a practical, generalizable method for reliable deep-hierarchy predictions across biology and vision domains.
Abstract
In hierarchical multi-label classification, a persistent challenge is enabling model predictions to reach deeper levels of the hierarchy for more detailed or fine-grained classifications. This difficulty partly arises from the natural rarity of certain classes (or hierarchical nodes) and the hierarchical constraint that ensures child nodes are almost always less frequent than their parents. To address this, we propose a weighted loss objective for neural networks that combines node-wise imbalance weighting with focal weighting components, the latter leveraging modern quantification of ensemble uncertainties. By emphasizing rare nodes rather than rare observations (data points), and focusing on uncertain nodes for each model output distribution during training, we observe improvements in recall by up to a factor of five on benchmark datasets, along with statistically significant gains in $F_{1}$ score. We also show our approach aids convolutional networks on challenging tasks, as in situations with suboptimal encoders or limited data.
