HiLight: A Hierarchy-aware Light Global Model with Hierarchical Local ConTrastive Learning
Zhijian Chen, Zhonghua Li, Jianxin Yang, Ye Qi
TL;DR
HiLight addresses the scalability issues of hierarchy-aware HTC by eliminating the structure encoder and instead using Hierarchical Local Contrastive Learning (HiLCL) within a lightweight global model composed of a text encoder and a multi-label head. HiLCL combines Local Contrastive Learning with a Hierarchical Learning schedule (HiLearn) to enforce discriminative, path-consistent behavior among labels, especially at finer granularity. Experiments on WOS and RCV1-v2 show competitive Micro-F1 and Macro-F1 scores while achieving superior parameter efficiency and robustness to collapse, outperforming several structure-encoder baselines on key metrics. The work demonstrates that hierarchical information can be effectively injected through task-driven contrastive learning without increasing model size, offering a scalable alternative for HTC in large taxonomies.
Abstract
Hierarchical text classification (HTC) is a special sub-task of multi-label classification (MLC) whose taxonomy is constructed as a tree and each sample is assigned with at least one path in the tree. Latest HTC models contain three modules: a text encoder, a structure encoder and a multi-label classification head. Specially, the structure encoder is designed to encode the hierarchy of taxonomy. However, the structure encoder has scale problem. As the taxonomy size increases, the learnable parameters of recent HTC works grow rapidly. Recursive regularization is another widely-used method to introduce hierarchical information but it has collapse problem and generally relaxed by assigning with a small weight (ie. 1e-6). In this paper, we propose a Hierarchy-aware Light Global model with Hierarchical local conTrastive learning (HiLight), a lightweight and efficient global model only consisting of a text encoder and a multi-label classification head. We propose a new learning task to introduce the hierarchical information, called Hierarchical Local Contrastive Learning (HiLCL). Extensive experiments are conducted on two benchmark datasets to demonstrate the effectiveness of our model.
