LH-Mix: Local Hierarchy Correlation Guided Mixup over Hierarchical Prompt Tuning
Fanshuang Kong, Richong Zhang, Ziqiao Wang
TL;DR
This work tackles hierarchical text classification (HTC) by leveraging a text-specific local hierarchy and introducing LH-Mix, which fuses depth-level hierarchical prompts with Mixup to model latent correlations among sibling labels. Local hierarchies are represented as sequences; a similarity measure based on the local-hierarchy CLS representation guides an adaptive Mixup ratio $oldsymbol{\lambda} = -(eta - 0.5) s^{\alpha} + \beta$, enabling more informative in-between samples. LH-Mix applies Mixup to both inputs and losses across each hierarchy depth, using a zero-bounded multi-label cross-entropy loss and loss mixing to optimize the model. Experiments on WOS, NYT, and RCV1-V2 demonstrate strong, consistent gains over state-of-the-art baselines, particularly in deeper and sparser hierarchies, underscoring the value of local-hierarchy-aware augmentation for HTC.
Abstract
Hierarchical text classification (HTC) aims to assign one or more labels in the hierarchy for each text. Many methods represent this structure as a global hierarchy, leading to redundant graph structures. To address this, incorporating a text-specific local hierarchy is essential. However, existing approaches often model this local hierarchy as a sequence, focusing on explicit parent-child relationships while ignoring implicit correlations among sibling/peer relationships. In this paper, we first integrate local hierarchies into a manual depth-level prompt to capture parent-child relationships. We then apply Mixup to this hierarchical prompt tuning scheme to improve the latent correlation within sibling/peer relationships. Notably, we propose a novel Mixup ratio guided by local hierarchy correlation to effectively capture intrinsic correlations. This Local Hierarchy Mixup (LH-Mix) model demonstrates remarkable performance across three widely-used datasets.
