HiCoLoRA: Addressing Context-Prompt Misalignment via Hierarchical Collaborative LoRA for Zero-Shot DST
Shuyu Zhang, Yifan Wei, Xinru Wang, Yanmin Zhu, Yangfan He, Yixuan Weng, Bin Li, Yujie Liu
TL;DR
HiCoLoRA tackles semantic misalignment in zero-shot dialog state tracking by introducing a hierarchical collaborative Low-Rank Adaptation framework that separates domain-agnostic and domain-specific knowledge. It combines UniRep-LoRA for universal context, SemAdapt-LoRA for domain prompts, an Adaptive Linear Fusion to balance them, spectral clustering for domain-slot disentanglement, and SemSVD-Init to preserve pre-trained knowledge, together forming a robust context-prompt alignment mechanism. The approach yields state-of-the-art performance on MultiWOZ and SGD in zs-DST, with strong cross-domain generalization and efficiency comparable to standard LoRA. This work offers a scalable, parameter-efficient strategy for zero-shot TOD deployment, reducing data requirements while maintaining high slot-inference fidelity across diverse domains; future work may further improve rare-slot handling and boundary detection.
Abstract
Zero-shot Dialog State Tracking (zs-DST) is essential for enabling Task-Oriented Dialog Systems (TODs) to generalize to new domains without costly data annotation. A central challenge lies in the semantic misalignment between dynamic dialog contexts and static prompts, leading to inflexible cross-layer coordination, domain interference, and catastrophic forgetting. To tackle this, we propose Hierarchical Collaborative Low-Rank Adaptation (HiCoLoRA), a framework that enhances zero-shot slot inference through robust prompt alignment. It features a hierarchical LoRA architecture for dynamic layer-specific processing (combining lower-layer heuristic grouping and higher-layer full interaction), integrates Spectral Joint Domain-Slot Clustering to identify transferable associations (feeding an Adaptive Linear Fusion Mechanism), and employs Semantic-Enhanced SVD Initialization (SemSVD-Init) to preserve pre-trained knowledge. Experiments on multi-domain datasets MultiWOZ and SGD show that HiCoLoRA outperforms baselines, achieving SOTA in zs-DST. Code is available at https://github.com/carsonz/HiCoLoRA.
